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■TOW PUnrnm Tnmnr rr 'isajiBatj^^ 

Field of ^ Tnv.p >j,-. n 

5 This Mention relates generally to the field of 

proteins, protein analogs, and methods ox specifics! 
altering proteins in order to change their biolo^aT 
activity. „ore particularly, the invention relltes^ to 
identifying the active site of a protein, for example 
10 Protease »exi»-l, and altering the actlve 

changing one or more a-ino acids therein to create an 
analog or replacing the active sit. „<• . / 
active site of a cLpletelyZ^t pXn ^ ^ 
Chimeric protein. The invention also relates to ^ " 

1T2 "! CySt6lTC reSidUeS ' ° r «" «=«ues 

ao ititT al^tT^ ^ Cmine Wlth ° Ut 

activity, and attaching polyethylene glycol to the thio 

*roup of cysteine, thereby increasing protein sZiuZ. 

BackornnrH ^ fh& Jnwn + im 
20 It has been known for some time that • ^ 

specific functions in the body. For exa n pl e natural 
Physiological functions such as tissue reaodelinT 
inflammation, coagulation, and fibrinolysis require 

« ZT'T SnZyTOS - °' Par " CUlar »PO"ance^s a 
« mechanistic class of proteases called serine proteases 
" is a!s„ k n„ m that the active site of all funct"^ 
•embers of the serine protease family contains T 
characteristic catalytic triad consisting of serine 

30 T ' ' BSPartlC aCld Md "istidine. The 

30 hydroxy! group of the catalytic site serine is involved 
in a nucleophilic attacx on the carbonyl carbon of IT 
Peptide bond to be „ydroly 2 .d resulting i„ acyLLn of 
the protease and hydrolysis of the peptide boL. T^is is 
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10 



followed rapidly by a deacylation step resulting in the 
release of intact protease. 

In order to provide examples of the present 
invention the inventors have focused on Protease nexin-i 
(PN-l) which is a serine protein purified from serum-free 
medium conditioned by human foreskin cells (Scott, R.w. 
et al., J ^io^ Chem (1983) 5fi: 1043910444) . It is'a 42 kd 
glycoprotein which is released by fibroblasts, myotubes, 
heart muscle cells, and vascular smooth muscle cells. 
Its release, along with that of plasminogen activator, is 
stimulated by phorbol esters and by mitogens (Eaton, D.L. 
et al., J Cell Biol (1983) 123:128). Native PN-l is an 
approximately 400 amino acid protein containing about 10% 
carbohydrate, since it is present only in trace levels 
15 in serum, it apparently functions at or near the surfaces 
of interstitial cells. PN-l inhibits all the known 
activators of urokinase proenzyme, plasmin, trypsin, 
thrombin, and Factor Xa (Eaton, D.L. et al., J Biol ch.. 
(1984) 159:6241). it also inhibits tissue plasminogen 
20 activator and urokinase. However, PN-l does not inhibit 
elastase or cathepsin G. 

In our earlier application now u. s. Patent 
5,187,089 we noted that the reactive site region of PN-l 
acts as a substrate analogue and postulated that it might 
25 be possible to drastically alter PN-l activity by 
modifying the reactive site sequence of PN-l, thus 
changing its protease specificity. Similar efforts with 
or-l-antitrypsin, for example, resulted in variants with 
altered and therapeutic potential (M. Courtney et al., 
30 Nature (1985) 113:149-151). PN-l is different from most 
serpins in that it is found in tissues, contains a high 
affinity heparin binding site which localizes it to 
tissues, and has a tissue clearance receptor that is 
responsible for endocytosis of protease-PN-1 complexes. 
We were able to generate PN-l variants as inhibitors of 



35 
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physiologic proteases such as lastase and thereby 
provide useful pharmaceutical^ active compounds. 

We have now invented a number of additional PN-i 
variants, including variants which inactivate elastase 
5 and cathepsin G, and have moved well beyond our prior 
work to provide variants and methods for designing and 
producing such variants which have significantly altered 
protease specificity and second-order association rate 
constants with respect to a variety of serine proteases. 
10 In addition, we provide variants and methods of producing 
such variants which have polyethylene glycol specifically 
attached to one or more cysteine residues, such cysteine 
residues being either present in the parent molecule or 
introduced on the surface of the protein by site-directed 
15 mutagenesis, and methods for determining appropriate 
sites for the introduction of cysteine residues, m 
addition, we provide variants and methods of producing 
such variants which combine the specific localization 
ability of a receptor-binding protein with the protease- 
20 inhibiting activity of PN-1 or variant thereof, resulting 
m desired biological activities with particular 
substrates . 

Summary of i- he Invention 

Chimeric proteins, also referred to herein as 

25 variants, and methods of producing, formulating and 
utilizing such proteins are disclosed. Five different 
general types of variants are disclosed. A Type I 
variant of the invention is produced by site-directed 
mutagenesis wherein a single amino acid within the active 

30 site of PN-l is substituted with an amino acid different 
from the naturally-occurring amino acid at that position. 
A Type II variant of the invention is similar to a Type I 
variant in that the active site of pn-1 is changed. 
However, to produce a Type II variant, the active site of 
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PN-l is changed so as to match the active site of another 
serpin which change many require one or more amino acid 
substitutions, deletions or additions. A Type III 
variant of the invention is produced whereby the active 
5 site or a portion thereof of PN-l is substituted with a 
sequence which corresponds to the substrate sequence for 
a particular protease. A Type IV variant is produced 
wherein a cysteine residue, which is either present in 
the native protein or introduced by site-specific 
10 mutation, is used to attach polyethylene glycol, a 
Type v variant of the invention involves producing a 
fusion protein wherein PN-l is fused to the receptor 
binding region of another protein in order to localize 
PN-l to a different receptor. Variants of Type I, II or 
15 in are referred to herein as variants. The compounds of 
Type IV wherein polyethylene glycol is attached to a thio 
group are referred to as cysteine-PEGylation proteins, 
and the compounds of Type V are referred to as fusion 
proteins or chimeric proteins. 
20 Embodiments of the present invention provide 

pharmaceutical compositions which contain one or more 
variants of all or any of Type I, II, m, iv or V. 

An important object of the invention is to provide 
a wide range of different variants, and in particular 
25 PN-l-variants, having a particular and desired biological 
activity. 

Another object is to provide PN-l variants using 
site-directed mutagenesis in order to change a single 
amino acid within the active site of PN-l. 
30 Another important object is to provide a PN-i- 

variant wherein the active site of PN-l is specifically 
modified so as to match the active site of another enzyme 
inhibitor, preferably another serpin. 

Yet another important object of the present 
35 invention is to provide variants such as protease nexin-l 



WO 95/11987 



PCT/US94/11624 



- 5 - 



variants which include, in PN-l, the substrate sequence 
for a different protease, making it possible to inhibit 
the activity of that protease. 

Still another important object is to provide 
proteins which are PEGylated by attachment to a thio 
group, i.e. the polyethylene glycol is attached to a 
cysteine amino acid within a protein, which cysteine 
amino acid of the protein is not involved in a disulfide 



bond. 

10 



Another important object is to provide a method of 
attaching polyethylene glycol to a protein by first 
subjecting the protein to site-directed mutagenesis to 
add a cysteine residue at a position where the protein or 
a structurally related protein is normally glycosylated, 
15 and thereafter attaching the polyethylene glycol to the' 
cysteine residue. 

Another important object is to provide a method of 
attaching PEG to a protein by first subjecting the 
protein to site-directed mutagenesis to add a cysteine 
residue at a position on the surface of the protein, and 
thereafter attaching the PEG to the cysteine residue. 

Another important object is to provide dimeric or 
multimeric proteins cross-linked by reaction with a 
reagent composed of PEG having two protein-reactive 



20 



30 



35 



Yet another important object is to provide fusion 
proteins wherein the receptor binding region of another 
protein is connected to PN-l in order to localize pm-1 to 
a different receptor. 

Another object of the present invention is to 
provide a pharmaceutical composition comprising excipient 
carrier materials having a compound of the invention 
dispersed therein. 

Another object of the present invention is to 
provide therapeutic methods of treatment which involve 
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administering to a patient in need thereof a 
pharmaceutically effective amount of a composition 
comprising excipients and a compound of the invention. 
A feature of the present invention is that the 
5 variants can be designed to have a specific receptor- 
binding domain while maintaining the natural biological 
activity of the protein to which the new binding domain 
is attached. 

An advantage of the present invention is that the 
10 variants have substantially different inhibitory effects 
on certain proteolytic enzymes as compared to the natural 
protein. 

Another object of the present invention is to 
provide variants useful in treating diseases associated 
15 with a specific biological activity. 

Yet another advantage of the present invention is 
to describe and disclose variants which are useful in 
treating elastase-related diseases. 

Another feature of the present invention is that 
20 certain variants have substantially altered protease 
specificity as compared with the natural protein. 

Another advantage of the present invention is that 
certain variants have substantially greater second order 
association rate constants with respect to particular 
25 serine proteases as compared with the second order 
association rate constant of natural proteins with 
respect to such serine proteases. 

Another advantage of the present invention is that 
certain variants have substantially slower second order 
30 association rate constants with respect to particular 
serine proteases as compared with the second order 
association rate constant of natural proteins with 
respect to such serine proteases. 

Yet another object is to provide methods of 
35 delivery such as by injection, internasal and 
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interpulmqnary delivery which methods are carried out 
using pharmaceutical compositions in the form of 
injectable formulations, spray formulations and aerosols 
Another advantage is that biologically stabilized 
5 proteins can be produced by attaching the polyethylene 
glycol to a cysteine residue of the protein. 

Another advantage is to provide methodology for 
readily attaching polyethylene glycol molecules to 
proteins at a cysteine residue of the protein which are 
0 preferably located at native sites of glycosylate. 

Yet another advantage is that amino acid residues 
for substitution with cysteine may be selected so that 
subsequent attachment of polyethylene glycol to the thio 
group of the substituted cysteine residue increases 
> biological stability of the cysteine-PEGylated protein 
relative to wild type without abolishing biological 
activity. 

Another advantage is that proteins which normally 
require glycosylate for biological stability may be 
produced commercially by expression in a prokaryotic host 
or other host which does not provide for glycosylated 
recombinant proteins. After expression of the protein by 
the prokaryotic host, biological stability of the protein 
can be increased by attachment of polyethylene glycol to 
a native or engineered cysteine residue in the protein. 

Another advantage is that cysteine-PEGylated 
proteins can be produced without exposing the protein to 
highly toxic chemicals such as dioxane, cyanuric 
chloride, DMF, or other chemicals used in conventional 
methods for attaching polyethylene glycol to a protein. 

These and other objects, advantages and features 
of the present invention will become apparent to those 
persons skilled in the art upon reading the details of 
the structure, synthesis, formulation and usage as more 
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fully set forth below reference being made to the 
accompany figures forming a part hereof. 

3rief Description of the m-aw^ t* 

Figure 1 shows the nucleotide sequence of the 
5 coding region and the deduced amino acid sequence of 
PN-io; and 

Figure 2 shows the nucleotide sequence of the 
coding region and the deduced amino acid sequence of 
PN-l/3. 

10 Figure 3 is a schematic drawing of a 

three-dimensional structure of PN-i as determined by x- 
ray crystallography. The approximate position of 
residues of particular interest are shown according to 
their relative position within a given helix (h) or /?- 
15 sheet (s) . The helices and 0-sheets of the pn-1 protein 
are each assigned letters (e.g. A, B, etc.) ( Eng h, et al. 
1990 Protein Engin. 3 (6) :469-477) . 

Figure 4 is a graph which shows the activity of 
samples of reaction mixtures containing cysteine- 
20 PEGylated PN-1 variants (N99C;N140C) produced by the 
method of the invention (open squares) and the activity 
of samples of reaction mixtures containing a PEGylated 
PN-1 variant (N99C;N140C) produced by a conventional 
method (closed diamonds) . 

25 Polled Dpscription of the Pr P fpr red RmhoH^^- 
Before the present compounds, variants, 
formulations and methods for making and using such are 
described, it is to be understood that this invention is 
not limited to the particular compounds, variants, 

30 formulations or methods described, as such variants 

formulations and methodologies may, of course, vary' it 
is to be understood that the terminology used herein is 
for the purpose of describing particular embodiments 
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only and is not intended to be limiting since the scope 
of the present invention will be limited only by the 
appended claims. 

It must be noted that as used in the specification 
5 and the appended claims, the singular forms "a*, ' an « and 
"the- i nclude plural referente unless ^ cQnteLt an ana 

dictates otherwise. Thus, for example, reference to "a 
protease nexin-l variant" includes mixtures of such 

10 IT1T 3 ' T ferenCe ^ anal ° gW inClUdeS ««« to 
10 mixtures of such analogs and reference to "the method of 

treatment" includes reference to one or more methods of 
treatment of the type which will be known to those 
skilled in the art or will become known to them upon 
reading this specification, and so forth. 

15 A - Definitiv e 

As used herein, "protease nexin-l" and " PN - 1M are 
used interchangeably and refer to DNA codons and 

pHri" 9 r ino acid sequences which ° ake up — 

20 Z , » ^ ShOWn respectivel y ^ Figures i and 2. 

PN-l xs distinguishable from the two other protease nexin 
factors, pn-ii and pn-iii (Knauer, D .J. et al., ^ 
^ (1982, 252,1509.-15104,, which are also thro^T 
inhibitors, but are l ess strongly binding to this 
protease and are of different molecular weight, three- 
25 dimensional structure and mechanism of function 

The terms "variants", "protein variants" and 
chimeric proteins" are used interchangeably herein to 
refer to any amino acid sequence which corresponds to the 
ammo acid sequence of a natural protein or a 
30 biologically active portion of a nat ural protein except 
that some change has been made in the structures 
Typical changes per the present invention involve- 
(1) one or more amino acids within the natural sequence 
is replaced with one or more amino acids different from 
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the amino acids present in the natural protein; and/or 
(2) one or more amino acids has been added to the natural 
sequence, and the addition of such amino acids changes 
the biological activity of the variant; and/or (3) one or 
5 more amino acids is deleted from the natural sequence- 
and/or (4) polyethylene glycol is bound to a thio group 
of a natural or artificially introduced cysteine residue 
of a sequence; and/or (5) two naturally occurring 
sequences are fused together, i.e. two sequences not 
10 normally connected are fused. 

"Protease nexin-1 variants" and "analogs of 
protease nexin-1" are terms which are used synonymously 
herein to define a Type I variant and are thereby 
encompassed by the term "variant." The terms are 
L5 intended to refer generally to proteins wherein one or 
more of the amino acids within protease nexin-1 have been 
substituted with a different amino acid. More 
specifically, the protease nexin-l variants of the 
invention include substantially the same amino acid 
:o sequence as protease nexin-1 but for the substitution of 
different amino acids at or near the active site 
Specifically, substitutions of different amino acids can 
be made at any of P„ p 2 , Pjf P< sites and/Qr fflade at ^ 

V, P 2 ', or P 3 », p 4 . S it es . Although other substitutions 
5 and deletions of amino acids in the sequence of protease 
nexin-1 are encompassed by this invention, the 
substitutions at or near the active site are most 
important with respect to changing the specificity and/or 
reactivity of the variant with respect to particular 
» proteases. Particularly preferred protease nexin-l 
variants of the invention are variants which have high 
activity relative to a substrate to which natural pn-i 
has little or no activity such as variants which inhibit 
elastase and, more particularly, which inhibit elastase 
and have their ability to inhibit elastase enhanced in 
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the presence of heparin and/or heparin-li ke compounds. 
Other preferred protease nexin-i variants, for example 
have increased ability to inhibit urokinase and/or 
another serine protease as compared with protease 
5 nexin-i. 

"Control sequence" refers to a DNA sequence or 
sequences which are capable, when properly li gat ed to a 
desired coding sequence, of effecting its expression in 
hosts compatible with such sequences, such control 

10 sequences include at least promoters in both procaryotic 
and eucaryotic hosts, and preferably, transcription 
termination signals. Additional factors necessary or 
helpful in effecting expression may also be identified 
As used herein, -control sequences" simply refers to 

15 whatever DNA sequence may be required to effect 
expression in the particular host used. 

"Cells" or "cell cultures" or "recombinant host 

will I"/' 1108 " CGllSW ^ ° ften US6d interchangeably as 
will be clear from the context. These terms include the 

20 immediate subject cell, and, of course, the progeny 
thereof. it is understood that not all progeny are 
exactly identical to the parental cell, due to chance 
mutations or differences i„ environment. However, such 
altered progeny are included in these terms, so long as 
25 the progeny retain the characteristics relevant to those 
conferred on the originally transformed cell, m the 
present case, for example, such a characteristic might be 
the ability to produce recombinant PN-i 

"Purified" or "pure" refers to material which is 
30 free from substances which normally accompany it as found 
in its native state. Thus "pure" PN-l-encoding DNA 
refers to DNA which is found in isolation from its native 
environment and free of association with DNAs encoding 
other proteins normally produced by cells natively 
35 producing PN-l. "Pu re " pn-x refers to ^ whicn 
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not contain materials normally associated with its in 
situ environment in human or other mammalian tissue?" of 
course, "pure- pn-1 may include materials in covalent 
association with it, such as glycoside residues or 
5 materials introduced for, for example, formulation as a 
therapeutic. The term "pure" also includes variants 
wherein compounds such as polyethylene glycol, Biotin or 
other moieties are bound to the variant in order to allow 
for the attachment of other compounds and/or provide for 
10 formulations useful in therapeutic treatment or 
diagnostic procedures. "Pure" simply designates a 
situation wherein the substance referred to is, or has 
been, isolated from its native environment and materials 
which normally accompany it. Of course, the DNA claimed 
15 herein as purified and free of substances normally 
accompanying it, but encoding PN-l, can include 
additional sequence at the 5' and/or 3- end of the coding 
sequence which might result, for example, from reverse 
transcription of the noncoding portions of the message 
20 when the DNA is derived from a cDNA library or might 

include the reverse transcript for the signal sequence as 
well as the mature protein encoding sequence. 

"Degenerate with", as referred to a DNA sequence, 
refers to nucleotide sequences encoding the same amino 
25 acid sequence as that referenced. 

"Operably linked" refers to a juxtaposition 
wherein the components are configured so as to perform a 
desired function such as their natural biochemical 
function. Thus, control sequences or promoters operably 
30 linked to a coding sequence are capable of effecting the 
expression of the coding sequence. 

"Heparin", "heparan sulfate" and "heparin- like 
compounds" are terms which are used synonymously herein. 
Each of the terms singly or in combination with the 
35 others is intended to encompass a large group of 



WO 95/11987 



PCT/US94/11624 



- 13 - 



compounds which are generally described as sulfated 
polysaccharides, which includes proteoglycans and 
glycosaminoglycans (GAG) which are alternating copolymers 
of a hexosamine and an aldouronic acid. These copolymers 
5 are found in sulfated forms and are synthesized as 
proteoglycans and are collectively referred to as 
mucopolysaccharides. Other compounds such as dextran 
sulfate are considered "heparin- like- for purposes of the 
invention. Similar alternating copolymers, especially 
10 those which are highly sulfated and thus very negatively 
charged, are useful "heparin-like" compounds in this 
invention. Extensive information with respect to 
"heparin", "heparin-like compounds" such as 
glycosaminoglycans are described in detail by Benito 
15 Casu, -structure and Biological Activity of Heparin", 
published in Advances in Carbohydrate Chemistry and ' 
Biochemistry, Vol. 43 , pp. 51-134, which is incorporated 
herein by reference to disclose such compounds which 
Bight be useful in combination with certain PN-i variants 
20 disclosed herein. 

A description of the invention is facilitated by 
listing the relationship between the one-letter symbols 
and the three-letter abbreviations for amino acids as 
follows: 
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Ope-Letter Symbols Three-LettPr 
Abbrevia t i ons 



A alanine 
C cysteine 
5 D aspartic acid 



H histidine 

10 I isoleucine 

K lysine 

L leucine 

M methionine 

N asparagine 

15 P proline 

Q glutamine 

R arginine 

S serine 

T threonine 

20 V valine 

W tryptophan 

Y tyrosine 



ala 
cys 



asp 

E glutamic acid gln 
F phenylalanine phe 
G glycine gly 



his 

ile 

lys 

leu 

met 

asn 

pro 

gin 

arg 

ser 

thr 

val 

trp 

tyr 



Amino acids have the following general structural 

formula 



25 NH 3+ 



-OOC-C-R 
I 

H 
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X III Cl ^ Siti ' a "~ a °» «» chemical composition of 

the "R" group as follows: 



1 • Aliphatic 

2 . Hydroxy 1 

3 . Sulfur 

4 . Aromatic 

5. Acidic (and amides) 

6. Basic 

7 . Imino 



10 d ... N t, tUrally amino acids can be generally 

classified as being polar or non-polar as follows: 

Polar S , T, C, Y, D , N, E, Q, R> H , K 

Non-polar G , A , V, L, I, M, F, W, p 

15 ^ 9r ° UP WhiCh deteraines whether the amino 

15 acid will be polar or non-polar. 

into An,in ° aCi<J reSidU6S Cai! ^ 9enerall y -^classified 
into four major subclasses as follows: 

Acidic: The residue has a negative charge due to 
loss of H xon at physiological p H and the residue is 
20 attracted by aqueous solution so as to seek the surface 
positions in the conformation of a peptide in which it is 
contained when the peptide is in aqueous medium at 
physiological pH. 

Basic: The residue has a positive charge due to 
25 association with H ion at physiological pH and the 

residue is attracted by aqueous solution so as to seek 
the surface positions in the conformation of a peptide in 
which it is contained when the peptide is in aqueous 
medium at physiological pH. 
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Neutral/non-polar: Th residues are not charged 
at physiological pH and the residue is repelled by 
aqueous solution so as to seek the inner positions in the 
conformation of a peptide in which it is contained when 
5 the peptide is in aqueous medium. These residues are 
also designated "hydrophobic" herein. 

Neutral/polar: The residues are not charged at 
physiological p H , but the residue is attracted by aqueous 
solution so as to seek the outer positions in the 
10 conformation of a peptide in which it is contained when 
the peptide is in aqueous medium. 

It is understood, of course, that in a statistical 
collection of individual residue molecules some molecules 
will be charged, and some not, and there will be an 
15 attraction for or repulsion from an aqueous medium to a 
greater or lesser extent. To fit the definition of 
"charged", a significant percentage (at least 
approximately 25%) of the individual molecules are 
charged at physiological pH. The degree of attraction or 
20 repulsion required for classification as polar or non- 
polar is arbitrary, and, therefore, amino acids 
specifically contemplated by the invention have been 
specifically classified as one or the other. Most amino 
acids not specifically named can be classified on the 
25 basis of known behavior. 

Amino acid residues can be further subclassif ied 
as cyclic or noncyclic, and aromatic or nonaromatic, 
self-explanatory classifications with respect to the side 
chain substituent groups of the residues, and as small or 
30 large. The residue is considered small if it contains a 
total of 4 carbon atoms or less, inclusive of the 
carboxyl carbon. Small residues are, of course, always 
nonaromatic. 
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For the naturally occurring pr tein amino acids 
subclassif ication according to the foregoing scheme is as 
follows: 



Acinic: Aspartic acid and Glutamic acid; 
fiasic/noncycljn: Arginine, Lysine; 
Basic/cveUr-. Histidine; 

Neutral /polar/small; Threonine, Serine and 
Cysteine; 

Neutral/polar/larag/ n onaromai-^ . Threonine, 
Asparagine, Glutamine; 

Neutra 1 /polar / 1 ar^ / aroma t i n. . Tyrosine; 

Neutral /non-Dol a r/ C n. a i i . Alanine; 

HeutpaVpon-poiarnarae/nonarn^f^ . Valine, 
Isoleucine, Leucine, Methionine; 

Neutral /non-polar /I arae/amm**-^ . Phenylalanine 
and Tryptophan. 



Proline 

The gene-encoded amino acid proline, although 
technically within the group neutral/non-polar/ large/ 
20 cyclic and nonaromatic, is a special case due to its 
known effects on the secondary conformation of peptide 
chains, and is not, therefore, included in this defined 
group, but is included as a group of its own. 

Other amino acid substitutions for those encoded 
25 in the gene can also be included in peptide compounds 
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within the scope of the invention and can be classified 
within this general scheme. 

Variants of the invention may include commonly 
encountered amino acids, which are not encoded by the 
5 genetic code, for example, 0-alanine (0-ala) , or other 
omega-amino acids, such as 3-amino propionic, 4-amino 
butyric and so forth, ct-aminoisobutyric acid (Aib) , 
sarcosine (Sar) , ornithine (Orn) , citrulline (Cit) , 
t-butylalanine (t-BuA) , t-butylglycine (t-BuG) , 
10 N-methylisoleucine (N-Melle) , phenylglycine (Phg) , and 
cyclohexylalanine (Cha) , norleucine (Nle) , cysteic acid 
(Cya) and methionine sulfoxide (MSO) . These also fall 
conveniently into particular categories. 

Based on the above definition, 
15 Sar and 0-ala are neutral/non-polar/ small; 

t-BuA, t-BuG, N-Melle, Nle and Cha are neutral/ 
non-polar/ large/nonaromat ic ; 

Orn is basic/noncyclic; 

Cya is acidic; 
20 cit, Acetyl Lys, and MSO are neutral/polar/ 

large/nonaromatic; and 

Phg is neutral/non-polar/ large/aromatic. 
Both L and D isomers of amino acids encoded by the 
genetic code or otherwise are included as amino acids 
25 useful in this invention provided the resulting protein 
processes the required activity. 

The various omega-amino acids are classified 
according to size as neutral/non-polar/ small (jff-ala, 
i.e., 3-aminopropionic, 4-aminobutyric) or large (all 
30 others) . 

The nomenclature used to describe compounds of the 
present invention follows the conventional practice 
wherein the amino group is assumed to the left and the 
carboxyl group to the right of each amino acid in the 
35 peptide. In the formulas representing selected specific 
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embodiments of the present invention, the amino- and 
carboxyl-terminal groups, although often not specifically 
shown, will be understood to be in the form they would 
assume at physiological pH values, unless otherwise 
specified. Thus, the N-terminal H + 2 and C-termirtal-O" at 
physiological pH are understood to be present though not 
necessarily specified and shown, either in specific 
examples or in generic formulas. 

Serine Proteases and their TnMK jtors f .g aTT ^ o) 
Although originally named for their mechanism of 
action, members of the serine protease family also show 
significant sequence and structural homology. Some 
serine proteases are very specific, cleaving only certain 
peptide bonds of a specific target protein while others 
15 are very nonspecific, degrading multiple target proteins 
into small peptides. 

Serine proteases are regulated at many levels. 
Some are synthesized as inactive proenzymes and are 
activated only during specific events and at specific 
20 locations. This allows the body to respond rapidly to a 
physiological perturbation by activating an already 
present reservoir of proteolytic activity. Coagulation, 
for example, is carried out when circulating proenzymes' 
such as Factor X and prothrombin are sequentially 
25 activated in response to injury resulting in a cascade of 
clotting activity. In addition, proteolytic activity is 
often localized to specific sites, such as receptor 
binding sites which can cause high local concentrations 
of protease or proenzyme ready for activation. 
JO once activated, it is extremely important that 

proteolytic activity be confined both spatially and 
temporally. This control is often achieved by the 
presence of specific inhibitors which block proteolytic 
activity. An important family of related proteins, the 
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serine protease inhibitors, or "serpins", are key in the 
regulation of serine proteases. Like the serine 
proteases, serpins were first defined by their common 
mechanism of action but later turned out to be highly 
homologous both in terms of sequence and structure. 

Serpins all contain an inhibitor domain with a 
reactive peptide bond defined on either side by the 
variables P, and P,'. m a direction to the left away 
from the reactive site, the amino acids are referred to 
as *V P 3 » etc., and in a direction to the right away 
from the reactive site they are referred to as Pj • , p 2 ., 
P 3 '/ etc. The P, residue is recognized by the substrate 
binding pocket of the target protease which attacks the 
reactive peptide bond as though a normal substrate. 
15 However, hydrolysis of the peptide bond and release of 
the protease does not proceed to completion. The normal 
deacylation step is so slow that the reaction becomes 
essentially irreversible and the protease becomes trapped 
in a stable, equimolar complex. 
20 Protease nexin-1 (PN-1) is a member of the serpin 

family. PN -l is produced by many different cell types 
including fibroblasts, glial cells, platelets and 
microphages. PN-l is secreted by cells into the 
extracellular environment where it binds to and inhibits 
25 target serine proteases. PN-l-protease complexes then 
bind back to specific cell surface receptors where they 
are internalized and degraded. 

PN-l is very similar, both structurally and 
functionally to antithrombin (AT-III) . AT-Ili is the 
30 primary plasma inhibitor of blood coagulation. The 
inhibition of thrombin by AT-III in plasma is normally 
very weak but is accelerated significantly by the 
presence of heparin or by other mucopolysaccharides on 
the endothelial lining of blood vessels. The therapeutic 
value of heparin as a blood "thinning- agent is due to 
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its enhancement of AT-Ili activity. Like AT-III, PN-1 
has a high affinity heparin binding site and inhibits 
thrombin much more rapidly (50-100 fold) in the presence 
of heparin. Thus PN-l has therapeutic potential as an 
5 anticoagulant. 

On the other hand, PN-l differs from AT-IH in a 
number of ways. Unlike AT-m, PN-l is also a good 
inhibitor of the fibrinolytic enzymes urokinase and 
plasmin, as well as trypsin. Furthermore, PN -l is not 
10 found in significant quantities in plasma and may 

function primarily in the tissues. The high affinity 
heparin binding site of pn-1 serves to localize it to 
connective tissues and cells which contain sulfated 
proteoglycans on their surface and surrounding 
15 extracellular matrix. Thus PN-l's primary role seems to 
be in regulating proteolytic activity i„ tissues as 
opposed to blood. Further evidence for the role of pn-i 
as found by the fact that it is present in brain tissue 
and may be involved in peripheral nerve regeneration and 
20 neurite extension. 

The relative efficiency with which PN-l inhibits 
serine proteases can be measured by the second order 
association rate constant (k^) as previously described in 
Bieth, J.G. (Bull. Euro. Phv^np^h P ^ r (1980) 
25 16:183-195), and reported by Scott et al. ( J . Biol, ch^ 
(1985) 26^:7029-7034), both of which are incorporated 
herein by reference to disclose and explain the meaning 
of the rate association constant. i„ general, a value 
for k w equal to or greater than 1 x lo 5 it's' 1 for a 
particular protease-inhibitor reaction is considered to 
be physiologically significant ( Travis ,nH c a lvAgf>n R _ 
Rev. Biochem. (1983) 52:655-709). The k M or rate 
association constant has inverse-mole-seconds as its 
units, and the larger the k„„ the more rapid the 
35 inhibition. Accordingly, a k^ value is always given as a 
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value with respect to a particular enzym and is zero if 
there is no inhibition of the enzyme. 

Many physiologically important protease inhibitor 
reactions such as elastase-a-l antitrypsin and plasmin- 
5 o-2-antiplasmin occur with rate constants as high as 1 x 
10 7 M-'S' or greater. The thrombin-PN-l reaction occurs at 
a similar high rate in the presence of heparin. 

Descripl-ir^ of p W _-, fa anrt ^ 

Figures 1 and 2, respectively, show the amino acid 
10 sequence of PN-la and PN-lp\ The « and 0 forms differ by 
the substitution of thr, I0 -gly s|I in PN , 1/S for ^ 
PN-la. Alignment of the reactive site center of PN-l 
with other serpins, such as antithrombin III, predicts 
that arginine 345 (346 for PN-10) is the reactive site 
15 center or -p,- site. The -p,- site (arginine at position 
345 for PN-la and 346 for PN-l/?) has been confirmed by 
sequencing of the peptide fragment released from PN-l 
upon dissociation of complexes with thrombin. 
Furthermore, PN-l normally inhibits only enzymes which 
20 cleave at arginine (the P, residue), such as thrombin, 
plasmin, trypsin, plasminogen activators, and plasma 
kallikrein. 

Based on the above and by referring to the 
sequences of PN-la and PN- 10 shown in Figures i and 2 
25 respectively, it can be seen that the -P.-site is serine 
at position 346 for PN-la and serine at position 347 for 
PN-10. 



30 



Description of Prot pase TnMMtor AnM„„ 

In order to allow the body to respond rapidly 
several serine proteases are synthesized at relatively 
high levels in their inactive proenzyme forms and are 
only activated during specific events. For example, 
coagulation is carried out when circulating proenzymes 
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such as Factor X and prothrombin are sequentially 
activated in response to an injury. This activation 
results in a cascade of clotting activity. Proteolytic 
activity is often localized to specific sites such as 
5 receptor binding sites. Once a proteolytic enzyme is 
activated, it is extremely important that the enzyme 
activity be confined both spatially and temporally. Such 
confinement is in part brought about by the inhibitory 
effect of serpins. 

10 An serpins contain an inhibitor domain with a 

reactive peptide bond defined on either side by P, and P,» 
residues. The P, residue (such as arginine at position ' 
345 for PN-lor and 346 for PN-10) is recognized by the 
substrate binding pocket of the target protease. Upon 
15 recognition of the -reactive', site (of the inhibitor by 
the protease) the protease attacks the reactive peptide 
bond of the inhibitor as if it were a normal substrate 
However, in the case of serpin hydrolysis of the peptide 
bond and release of the protease does not proceed to 
20 completion. The normal deacylation step is so slow that 
the reaction becomes essentially irreversible and the 
protease becomes trapped with the inhibitors in a stable, 
covalent, equal molar complex, since the P, residue is 
the predominant determinant residue recognized by the 
25 substrate binding pocket of the target protease, 
, alteration of this residue can alter the protease 
specificity of the inhibitor entirely or substantially 
change the degree of the inhibitory effect obtainable. 
Residues near the P, residue (i.e., p 4 -p<.) also 
«0 contribute to protease specificity. Accordingly, 

alteration of these residues can also lead to modified 
inhibitory effects. 
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Variants in General 

Amino acid sequences, active sites and biochemical 
activity of a number of natural proteins and in 
particular natural serpins are known, it is also known 
5 that some proteins have a high degree of activity with 
respect to a certain protease, whereas another protein 
will have virtually no activity with respect to that 
protease. The present inventors noted this information 
and deduced that it might be possible to change the 
10 protease inhibitory activity of a particular protein, in 
a directed manner, by replacing its active site with the 
active site of another protein having a completely 
different activity with respect to that protease. 
Alternatively, one can create a variant of the invention 
15 by attaching to a first protein, the receptor binding 
region of a second protein, in that the biochemical ' 
function and protein binding specificity of different 
proteins are known, the methodology of the present 
invention makes it possible to make certain logical 
20 deductions and create a specific variant with a 

relatively high expectation of not only changing the 
activity (e.g. binding affinity) of the protein, but 
changing it in a specific and directed manner. 

An example of producing chimeras (Type V variants) 
25 of the invention can be carried out as follows. A first 
protein might be known to have virtually no binding 
affinity with respect to a given receptor and a second 
protein might have very high binding affinity with 
respect to that receptor. (A receptor can be any protein 
30 or a portion thereof, ligand, cell surface area or 

molecule.) By attaching the binding region of the second 
protein to the first protein one can provide a variant 
with the biochemical functions of the first protein which 
will bind to the receptor previously bound only by the 
35 second protein. A specific example of utilizing the 
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methodology of th present invention in order to taxe 
advantage of the binding specificity and biochemical 
activity of two different proteins is described beXow 
A specific situation to which variants of the ' 

of .onocytes and neutrophils from the bloodstream into 
inflated t 1S sue retires activation events at the site of 
interaction. These events include expression Tt 
molecules and recruitment of transmigLion Lnin^ 
To migrate through the endothelium, the connection^ 
between the cells must be degraded, including the 
basement membrane and extraceHular matrix. It is now 

ST**- th " -Mnase-type plasminogln 
activator (upa) -plasmin system plays a major role in 
» regulating extracellular proteolysis i . e . uPA^lLin is 
portent in breaxin, intracellular connections so tha t 

ZU l 9 ^rounding tissues. Although the 

) en!ot h T °* cells through the 

endothelxum to the surrounding tissue is desirable too 
-* migration too ouic*ly results in undesirable 

inflammation of the tissue and, possibly, blocxage of the 
bloodstream passing through the inflamed tissues » 
vacant which would (1) bind to activated endothelium 
cells and ( 2 , inhibit uPA-plasmin would be useful JT 
preventing and/or reducing inflammation at a particular 

T ™°r =ell invasiveness is also dependent upon the 
uPA-plasmxn system as shown in tumor cell metastasis 
model systems (Ossowsxi and Reich, 19,3 ; Hearing et .1 
»..) and extracellular matrix degradation and o.sement 
membrane invasion (Cergman, et al., 1986; „i„n a tti T 

levels of upa are significantly higher in human breast 
cane r tissues than in normal tissues. Increased amounts 
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of uPAR correlate with increased invasiveness of 
malignant cells in model systems (Ossowski, 1988; 
Hearing, et al. # 1988). Agents which block the ' 
proteolytic activity of uPA or plasmin may protect 
5 against extracellular matrix degradation and basement 
membrane invasion and aid in preventing inflammation. 
Similarly, agents which block the interaction of 
urokinase with the urokinase receptor (uPAR) might block 
metastasis . 

10 A variant protein of the present invention could 

be designed to block the urokinase receptor and inhibit 
urokinase. Specifically, a variant could be produced 
which combined the ability to block the urokinase 
receptor with the ability to inhibit urokinase and 
plasmin, and thereby have an effect on alleviating or 
preventing inflammation. Such a variant would be more 
effective in reducing or preventing inflammation than 
would either protein by itself. By making use of the 
urokinase receptor binding ability of the variant the 
variant will localize to the desired specific site of 
extracellular matrix degradation, specifically preventing 
further degradation by inhibiting enzymes and preventing 
the bmdxng of enzymes which cause degradation, thereby 
having a dual effect on alleviating or preventing 
25 inflammation. 

To produce a chimeric protein of the present 
invention the central role of uPA and uPAR in cancer 
invasion and inflammation were recognized, with this 
information in mind it was understood that the present 
invention should provide a chimeric protein which would 
interfere with the binding of uPA to uPAR and inhibit 
both uPA and plasmin generated at sites of cellular 
invasion. 

The receptor binding region of uPA has been 
localized to the 135 residue amino-terminal fragment 
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(ATF) . This region (i.e. ATF) binds to the uPAR with 
high affinity (K, o.i to 1 nM) and can competitively 
inhibit the binding of uPA to the uPAR. 

PN-l blocks tumor cell-mediated extracellular 
5 matrix degradation and tumor cell migration in v^o, in 
that PN-l inhibits the »PA-plasmin system, m ordtr" to 
generate a more effective and specific inhibitor of tumor 
metastasis and leukocyte invasion, the present invention 
provides a chimeric protein consisting of the amino- 
10 terminal fragment of uPA and PN-l (ATF-PN1, . Due to the 
presence of the ATF portion, this chimeric protein has a 
high affinity to sites of cellular invasion; and due to 
the PN-l protein it inhibits upa and plasmin. 

Type T Variant- 
15 This aspect of the invention involves the 

manipulation of the amino acid sequence of the PN-l so 
that the reactive site is in some way altered, to change 
the protease specificity or the degree of inhibitory 

20 r'V' PN_1 °" S6rine Pr ° teaSeS - *«• specially, the 
20 present mvention involves substituting one or more amino 
acids within protease nexin-1 and/or deleting or adding 
amino acids to the sequence of protease nexin-1 in order 
to obtain an effect on the reactive site of protease 
nexin-1 so that the protease specificity of protease 
25 nexin-l and/or the degree of inhibitory effect of 

protease nexin-1 on a serine protease is changed, m 
general, the change in protease specificity or degree of 
inhibitory effect is obtained by substituting an amino 
acid at the P,, P 2 , p 3 , p< or/ alternatively, P,', p.. p . 
30 P 4 . sites, still more specifically, the invention ' ' ' 
involves substituting one or both of the "P," site 
arginine residue or - P , . « s i te ser ine residue with a 
different residue resulting in PN-l variants with 
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radically different protease specificities and/or 
inhibitory effects on particular serine proteases. 

The pn-1 variants of the invention can also be 
described in terms of their functionality. Importantly 
5 some of the PN-1 variants of the invention are capable of 
inhibiting elastase. Within this general group are PN-i 
variants wherein the ability to inhibit elastase is 
greatly enhanced in the presence of heparin and/or 
hepar in-like compounds. Another group of pn-1 variants 
10 of the invention include PN-l variants which have an 

enhanced ability to inhibit serine proteases as compared 
with PN-l. The PN-l variants of the invention are 
designed to inhibit serine proteases such as urokinase, 
Factor Xa plasmin, kallikreih, Factor Xlla, Factor XIa' 
15 Factor Va, tPA, elastase, cathepsin and contrapsin. 
Functional objectives of the invention, such as the 
production of a compounds which inhibit serine proteases 
such as elastase and whose ability to inhibit such is 
enhanced in the presence of other compounds such as 
20 heparin, are obtained, in general, by manipulating DNA 
Specifically, the DNA encoding an enzyme is manipulated 
by including within the DNA a sequence of DNA which 
encodes a substrate for a particular serine protease. 
The recombinant DNA is then expressed to produce a 
25 variant of the invention which will include a substrate 
for the particular serine protease. in that the 
substrate is present, the activity of that serine 
protease can be specifically inhibited by the variant 
while the variant maintains its natural biological 
30 activity. 

The five different types of variants of the 
invention will now be described in further detail. 
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?N-1 Variants — Active sil-g Mamnm**-^ 

For purposes of clarity, substitution at a single 
site will be discussed first (P, site then P, • site) 
followed by a discussion of multiple substitutions. 

Type I Variants; Single Site Mutation., 

The arginine residue is a polar, basic amino acid 
Substitution of the polar arginine with a non-polar 
residue has a dramatic effect on the degree of serine 
protease inhibition obtainable. As a specific example of 
this effect, reference is made to our compound NCY2010 
wherein arginine (at position 345 for o, 346 for 0) is 
substituted with isoleucine (R3451) . The R345I variant 
(i.e., isoleucine has been substituted for arginine a 
position 345 of PN-la) essentially eliminates thrombin 
L5 inhibitory activity with or without heparin. At the same 
txme, the NCY2010 variant is a good inhibitor of 
neutrophil elastase activity. This is surprising when it 
is noted that native fibroblast PN-l has no inhibitory 
effect on elastase, and in fact is a substrate for 
elastase. 

The activity of NCY2010 should be viewed with the 
understanding that the relative efficiency at which 
protease inhibitors (such as the PN-l variants of the 
invention) inhibit serine proteases are measured by a 
known standard. That standard is the second order 
association rate constant (k.j as described in (Bieth, 
J ' 6 *' Bull. Euro. Physinp^y T,^ r (1980) ifi:183 . 195) ' 
and reported by Scott et al. ( J. Biol. Ph»m (1985) 
260:7029-7034), both of which are incorporated herein by 
reference to disclose second order association rate 
constants . 

In general, a value for k„, equal to or greater 
than 1 x 10 5 M-'s' for a particular protease-inhibitor 
reaction is considered to be physiologically significant 
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(Travis and Salves on Ann. Rgy. Bioch»m (ig 8 3) 
52:655-709), incorporated herein by reference to describe 
the significance of rate constants. Many physiologically 
important protease-inhibitor reactions such as elastase- 
5 a-l-antitrypsin and plasmin-a-2-antiplasmin occur with 
rate constants as high as 1 x 10 7 M-'S" 1 or greater. The 
thrombin-PN-l reaction occurs at a similar high rate, or 
faster, in the presence of heparin. 

Native PN-l has essentially no effect with respect 
10 to inhibiting the activity of elastase. However, the 
R345I variants of the present invention clearly provide 
not just a new biological activity for the serpin, i.e., 
its ability to inhibit elastase, but clearly provide an' 
extremely potent elastase inhibitor. The ability of the 
15 R345I variants to inhibit elastase to such a degree was 
in itself a surprising finding. However, it was clearly 
unexpected to find that, in addition to providing such a 
potent elastase inhibitor, these variants had still 
further increased potency with respect to inhibiting 
elastase while in the presence of heparin. 

Variants of the invention are clearly capable not 
only of providing improved potency with respect to acting 
as elastase inhibitors, but of providing such activity 
site-specifically in that their activity is greatly 
enhanced in the presence of heparin, heparin-like 
compounds or other related mucopolysaccharides normally 
found in the endothelial lining of blood vessels, m 
addition to heparin, a range of sulfated proteoglycans 
such as other heparin-like compounds normally found on 
the surface and surrounding extracellular matrix would 
provide not only increased potency with respect to the 
ability of the variants of the invention to inhibit 
elastase but provide site-specific activity due to the 
affinity of these variants to heparin and heparin-like 
compounds . 
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The inhibit ry effect of the R345I variant on 
elastase is increased approximately two orders of 
magnitude in the presence of heparin, it Can be readily 
determined that -p,. variants with non-polar residues such 
5 as valine substituted for the polar arginine residue 

could be used as a heparin activatable inhibitor in order 
to treat individuals suffering from elastase-related 
diseases such as emphysema, congenital a-i-antitrypsin 
deficiency, inflammation and septic shock. Non-polar 
10 residues which can be used include G, A, V, L, I M P 
and W, and more preferably (due to »r» group structures 

,alme, g, «, v ana L, and most preferably I 

In a broad sense, the present invention 
encompasses PN-! variants which are capable of acting as 
15 potent elastase inhibitors. More specifically, 

invention encompasses such PN-l variants which act as 
elastase inhibitors and further wherein the ability to 
inhibit elastase is greatly increased in the presence of 
heparin and heparin-like materials, still more 
20 specifically, the invention encompasses P N -i variants 

their Ti"! 6 . laStaSe inhibit0 " and * h -« variants have 
their ability to inhibit elastase increased 10 fold or 

more m the presence of heparin, preferably 50 fold or 

»ore in the presence of heparin and more preferably loo 

5 fold or more in the presence of heparin. Useful 

formulations of the invention include PN-l variants 

formulated in pharmaceutical compositions along with 

heparin and heparin-like compounds such as various 

sulfated polysaccharides or proteoglycans, it is 

0 particularly preferred if the heparin-like compounds are 

highly sulfated, thus providing high negative charges 

Above it has been pointed out that P, variants of 

the invention which include non-polar residues such as 

valine substituted for the polar ar gi » ine can be used to 

> treat individuals due to the ability of these P, variants 
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to inhibit the activity of elastase. This is quite 
surprising since other proteases such as urokinase and 
plasmm which are inhibited by pn-1 are not heparin 
activatable. While not wishing to be bound by any 
5 particular theory, it fflay be that the P, variants of the 
present invention are effective in inhibiting elastase 
due to the cationicity of elastase which promotes its 
binding to heparin which is anionic. Accordingly in 
order to obtain other PN-l variants which are heparin 
10 actavatable inhibitors of elastase, the active site 

should be substituted with other residues which are non- 
polar and have similar -r- groups in order to have a 
reasonable expectation of similar activity. 

Novel A SpP Pfc of pw .i W91 .^^. 

15 The sequence of PN-ia and PN-l/S are given in 

Figures 1 and 2 respectively. Further, factors 
describing the characteristics of both have been put 
forth above. Prior to the present disclosure, variants 
of the invention such as elastase inhibitors of any pn-i 
20 were not known. Further, it was not known whether any 
such variants would provide any activity, let alone the 
type of activity obtained, m fact, elastase cleaves 
native P N -i. This cleavage inactivates PN-l toward 
thrombin and other proteases, it was quite possible 
25 that, in changing Arg^ to Ile(R345I), PN -l could be 
changed into an even better substrate for elastase, 
leading to even quicker inactivation of PN-l. The' 
present invention not only provides variants wherein 
active sites have been replaced, but shows that such 
3 0 variants have activity and that the activity is 

substantially different from the activity of the original 
PN-l. now that a number of variants and their activity 
have been shown, it can be seen that still other variants 
which might possess activity can also be produced, in 
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connection therewith, it is postulated that variants can 
be produced wherein substitution is made at both the P 
and P,' sites. Such double substitutions could be put' 
forth in a variety of different ways. 
5 One approach to producing such variants is to 

substitute one of the sites with a residue which is 
substantially different from the residue present such as 
including a non-polar residue in place of a polar residue 
while substituting the other site with a residue which is 
10 substantially similar to the residue present there both 
in terms of being polar or non-polar and in terms of 
having a similar »r* group. Another approach is to 
substitute both sites with residues which are 
substantially different from the original residues. y e t 
15 another possible means for producing variants would be to 
use either of the above-suggested strategies in 
combination with substituting other sites, a variety of 
such substitutions will occur to those skilled in the art 
upon reading this disclosure, what is important is that 
20 the resulting variant continued to provide activity. The 
ability of the variant to provide activity will depend on 
the substrate specificity. Accordingly, the present 
invention is intended to encompass single, double and 
multiple substitutions of the residues to provide 
25 variants which continue to have activity with respect to 
a given protease or gain substantial activity with 
respect to another protease. 

In connection with the present invention, the PN-1 
variants which have activity are variants which have 

(1) substantially increased potency with respect to 
inhibiting tPA, urokinase, and/or other related enzymes; 

(2) substantially increased potency with respect to 
inhibiting elastase; or most preferably (3) substantially 
increased potency with respect to inhibiting elastase and 
which potency is still further increased dramatically in 



30 
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the presence of heparin, m that the present invention 
has demonstrated that it is possible to produce pn-1 
variants which inhibit elastase and has further 

5 TSTT** lt iS P ° SSible t0 Produce «* variants 

5 which not only inhibit elastase, but have substantially 
increased potency to inhibit elastase in the presence of 

be P abT Skill6d ^ ^ ^ ° f SUCh ^^itors will 

witH \T ° ther VariantS WhiCh ™ inte "^ to be 

wathxn the scope of the present invention. 

10 Specific p N -! VaT .j a „«. g 

Examples of protease nexin-1 variants of the 
invention are listed in Table 2 along with -indication" 
of the variant. 
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Record t 

1 
2 
3 
4 
5 
6 
7 
8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 



TABLE 1 
HSX mutation sequence 

TTAILIAR — 



NCY2000 


CHO PN-1 


NCY2001 


PI Ala 




NCY2002 


PI A>-rr 


(WT) 


NCY2003 


PlAsn 


NCY2004 


PlAsn 




NCY2005 


Pi Puc 




NCY2006 


PlGln 




NCY2007 


PlGln 




NCY2008 


PI Ui c 








NCY201 0 


Fine 






PILeu 




NCY2012 


PILys 




NCY2013 


riKet 






PlPhe 




NCY2015 


PIPro 




NCY2016 


PISer 




NCY2017 


PIThr 




NCY2018 


PITrp 




NCY2019 


PITyr 




NCY2020 


PIVal 




NCY2021 


P2Ala 


(WT) 


NCY2022 


P2Arg 


NCY2023 


P2Asn 




NCY2024 


P2Asp 




NCY2025 


P2Cys 




NCY2026 


P2Gln 




NCY2027 


P2Glu 




NCY2028 


P2Gly 





indicating 
SSPP 



30 


NCY2029 


P2His 


31 


NCY2030 


P2Ile 


32 


NCY2031 


P2Leu 


33 


NCY2032 


P2Lys 


34 


NCY2033 


P2Met 


35 


NCY2034 


P2Phe 


36 


NCY2035 


P2Pro 


37 


NCY203 6 


P2Ser 


38 


NCY2037 


P2Thr 


39 


NCY2038 


P2Trp 


40 


NCY2039 


P2Tyr 


41 


NCY2040 


P2Val 


42 


NCY2041 


P3Ala 


43 


NCY2042 


P3Arg 


44 


NCY2043 


P3Asn 


45 


NCY2044 


P3Asp 


46 


NCY2045 


P3Cys 


47 


NCY2046 


P3Gln 



antielastase 
antielastase 
antiplasmin 



antielastase 



faster 
kinetics 



faster 
kinetics 
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10 



15 



20 



25 



30 



35 



Record i 


f NCY 


48 


NCY2047 


49 


NCY2048 


50 


NCY2049 


51 


NCY2050 


52 


NCY2051 


53 


NCY2052 


54 


NCY2053 


55 


NCY2054 


56 


NCY2055 


57 


NCY2056 


58 


NCY2057 


59 


NCY2058 


60 


NCY2059 


61 


NCY^n fin 


62 


NCY2101 


63 


NCY21-02 


64 


NCY2103 


65 


NCY2104 


66 


NCY2105 


67 


NCY2106 


68 


NCY2107 


69 


Mpvo i no 


70 


NCY2109 


71 


NCY2110 


72 


NCY2111 


73 


NCY2112 


74 


NCY2113 


75 


NCY2114 


76 


NCY2115 


77 


NCY2116 


78 


NCY2117 


79 


NCY2118 


80 


NCY2119 


81 


NCY2120 



- 36 - 

mutation s auence 



indicating 



P3Glu 

P3Gly 

P3His 

P3Ile 

P3Leu 

P3Lys 

P3Met 

P3Phe 

P3Pro 

P3Ser 

P3Thr 

P3Trp 

P3Tyr 

P3Val 

PI' Ala 

Pl'Arg 

Pl'Asn 

PI • Asp 

Pl'Cys 

Pl'Gln 

Pl'Glu 

Pl'Gly 

Pl'His 

PI • lie 

PI • Leu 
Pl'Lys 
PI' Met 
Pl'Phe 
PI 'Pro 
Pl'Ser 
Pl'Thr 

Pl'Trp 
Pi »Tyr 
Pl'Val 



(WT) 



(WT) 



anti- 
Pactor Xa 



anti- 
Factor Xa 



40 



45 



The above examples 1-81 represent the substitution 
at different sites within the active site of PN-l. For 
example, record nos. 2-21 represent a substitution at the 
P, position of PN-l. The indication »WT« is provided to 
indicate the naturally-occurring or wild-type sequence 
produce via CHO cells. As is evident from this 
disclosure, the examples could be continued to include 
all 20 amino acid substitutions at each position within 
the active site, that is, all 20 naturally-occurring 
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amino acids could be substitut d, using site-directed 
mutagenesis, at positions P„ Pj , P , f P< , Pj , , ^ ^ 

It is possible to produce individual type I 
5 variants using site-directed mutagenesis. However, it is 
also possible to produce large numbers of Type i variants 
at the same time. For example, it is possible to produce 
the 64 million different variants simultaneously wherein 
all of the 20 naturally occurring amino acids are 
10 substituted at all 8 positions. Such can be carried out 
using a phage display synthesis methodology as disclosed 
within U.s. patent 5,223,409 issued June 29, 1993 
Further, the chemical synthesis methodology disclosed 

15 I!! 1 ," U * S - Pat6nt 5 '°"'»* i—d April 23, 1991 can be 
15 used to produce such large mixtures of variants 

Different types of screening methodology such as that 
disclosed within U.S. patent 5,223,409 can then be used 
to screen the variants to determine particular 
activities . 

20 Testing Type T V^r^nf. 

Any of the variants of the present invention can 
be tested by comparing the variants with a panel of 
proteases and determining their second-order rate 
constants with respect to the different proteases, such 
25 tests have been carried out with an exemplary number of 
Type I variants, and the results are described below in 
Tables 1A through 11. 
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TABLE 1A 
Wild-type {C HO 
. NCY 2000 

5 Protease Second Order Rate 

rQl:eaSe - Constant fvrlrl) 

1) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) Xa 
10 5) xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 
15 10) Activated protein c 

11) Activated protein C (hp) 

12) Elastase 



5.98 


X 


10 5 


1.28 


X 


10 5 


4.51 


X 


10 5 


7.45 


X 


10 3 


4.85 


X 


10 4 


1.49 


X 


10 5 


3.30 


X 


10 5 


2.50 


X 


10 5 


<100 






1.42 


X 


10 4 


1.96 


X 


10 6 


<100 







(hp) indicates in the presence of io Mg/ml of heparin. 



TABLE IB 
(E. coli PN-j) 
NCY 2002 



Protease 

25 1) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 
30 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 

10) Activated protein C 
35 11) Activated protein C (hp) 
12) Elastase 



Second Order Rate 
Constant fM-'fi" 1 ) 

5.99 x 10 5 
3.44 X 10 5 
3.98 X 10 5 
4.82 x 10 3 
2.48 x 10 4 
2.41 x 10 5 
2.35 x 10 5 
6.30 x 10 4 
<100 

8.56 x 10 2 
2.00 x 10 5 
<100 



(hp) indicates in the presence of 10 tig /ml of heparin. 
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TABLE 1C 
PN-1 rP T fi1y) 
NCY 209ft 

5 Protease Second Order Rate 

Constant fM-'g-») 

1) Thrombin 

2) Plasmin *'l 2 x 10 * 

3) Plasmin (hp) 5.52 x 10 4 

4) Xa 6 - 75 x 10 4 
10 5) xa (hp) 9 * 43 x 10 3 

6) Urokinase f'f 4 x 10 < 

7) Urokinase (hp) J;iJ x 10 

8) Kallikrein 5 1 ? 0 

9) Cathepsin G 3 '*i x 10 

11) Activated protein C (hp) ,?? ,„< 

12) Elastase ■ 1.61 x 10 5 



20 



(hp) indicates in the presence of 10 jig/mi of heparin. 



TABLE in 
PN-1 (P ^Proj 
_ NCY 2031 



Protease Second Order Rate 

Constan t fM-'fi-') 

25 l) Thrombin 

2) Plasmin 1.92 x 10 4 

3) Plasmin (hp) ?'f 8 x 10 ! 

4) xa !- 58 x 10 4 

5) xa (hp) 1.41 x 10 2 
30 6) Urokinase 7.33 x io 2 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G < ^°° 
10) Activated protein C Zt™ 

35 11) Activated protein C (hp) v in5 

12) Elastase 1.39 x 10 s 



(hp) indicates in the presence of 10 ng/ml of heparin. 
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TABLE IE 
NCY ?.07n 

5 Protease Second Order Rate 

— Constant fir's-') 

1) Thrombin 

2) Plasmin <10 ° 

3) Plasmin (hp) 

4) Xa 

10 5) xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein ~~~~ 

9) Cathepsin 6 

15 10) Activated protein C 

11) Activated protein C (hp) 

12) Elastase 

1.20 x 10 6 



(hp) 



indicates in the presence of 10 M g/ml of heparin. 



20 



TABLE IF 
PN-1 rP,T1oX 
WCY amn 

Proteasp Second Order Rate 

Constant- fir's-') 

25- i) Thrombin 

2) Plasmin <10 ° 

3) Plasmin (hp) 

4) Xa 

5) xa (hp) 

30 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G <10 ° 
10) Activated protein C <10 ° 

35 li) Activated protein c (hp) 

12) Elastase * P ' ~ 

4.15 x 10 6 



(hp) indicates in the presence of 10 M g/ml of heparin. 
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5 Protease 



10 



15 



TABLE in 

PN-l (PjLeul 
NCY 2011 , 



1) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 

10) Activated protein C 

11) Activated protein C (hp) 

12) Elastase 1 F ' 



Second Order Rate 
Constat (M-l<H} 

<100 



<100 
<100 



1.65 x 10 6 



20 



(hp) indicates in the presence of io Mg/lnl of hep arin. 



TABLE 1H 
NCY am 9 



Protease 

25 l) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 
30 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 

10) Activated protein C 
35 li) Activated protein C (hp) 
12) Elastase 



Second Order Rate 
Consta nt fir's-') 

2.59 x 10 4 

1.01 x 10 5 
1.51 x 10 4 
<100 

1.16 x 10 4 

<100 
<100 
<100 

3.02 x 10 4 



(hp) indicates in the presence of 10 M g/ml of heparin. 
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TABLE IT 
PN-1 fP , Met 



5 Protease Second Order Rate 

Constant fM-'sr') 

1) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) xa 

10 5) Xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein T~~" . 

9) Cathepsin G 4.81 x 10 4 

10) Activated protein C f 100 

11) Activated protein C (hp) 

12) Elastase 

<100 



15 



(hp) indicates in the presence of 10 M g/ml of heparin. 

20 TYPe IT Variants: s^ r ; n Active e» lr 

Type II variants of the invention are produced in 
a manner similar to Type I variants. However, the active 
site of PN-l is modified in a manner so that it matches 
the sequence of the active site of another protease, and 

25 preferably another serpin. Examples of Type II variants 
of the present invention include the following: 
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NCY# 


Seruin 




Protease 




Nexin-1 


^ 2203 


PAI-l 


2204 


PAI-2 


2205 


PAI-3* 


2201 


ATIII 




a2- 


10 2206 


anti- 




plasmin 




Cl- 


2207 


inhib- 




itor 


15 


Kalli- 


2208 


krein 




BP 


2209 


(rat) 



2210 
20 2211 

2212 



- 43 - 
TABLE ft 

^ £ 2 £l ^ 

Leu- He- Ala- Arg S er- Ser- Pro- Pro 

S£ 2; if: Ste « 
jte- ip- at? 2S- ffi- K: 0 

lie- Ala- £1*- Arg Ser - S_ 
Ala- Met- ser- Arg Met- Ser- Leu- Ser 

Ser- Val- Ala- Arg Thr- Leu- Lea- Val 

lie- Leu- ser- Arg Arg- Thr- Ser- Leu OR 

£he- Ara- lie- Leu Ser- Arg- Arg- ^ 
alAT Ala- He- Pro- Met Ser- lie- Pro- Pro 

r^atSr" ^~ Ala " ««- I«- Iffi- filn 



alAC Leu- Jeu- ser- Ala Leu- Val- 61u _ Thr 

»« hcxx ffi: gf: g: a ^ g- g 



OR 



25 uro- 
2213 kinase 



Jcxnase Met- Thr.- ciy.- Arg Tiir- Gly- fjis- Gl^ 0 



30 ? G i?-P?o teS thG SeqU6nCe is Preferably followed by 

Underlining residues indicates a difference fro™ <-k 
natural PN-l sequence. "-merence from the 

35 The same methodology referred to above with 

respect to Type I variants can be used in the production 
of Type II variants. Further, the methodology disclosed 
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within the above-cited patents (incorporated herein by 
reference) can be used to produce Type II variants. The 
methodology is modified only by first determining the 
active site of another serpin. After determining the 
5 amino acid sequence of the active site of a different 
serpin and studying the activity of that different serpin 
it is possible to produce a Type II variant having a 
particular and desired changed activity as compared with 
the naturally occurring PN-l. 

10 Testing T ype TT Vari.nfc 

As indicated above, all of the variants of the 
present invention can be tested by determining their 
second order rate constants relative to a panel of 
proteases. As with the Type I variants, tests were 
15 carried out with a representative number of Type II 

variants, and the results obtained are put forth below in 
Tables 2 A and 2B. 



TABLE 2A 

2201 I AT TTT) 

20 

Protease Second Order Rate 

Constant fir's-') 

1) Thrombin 

2) Plasmin 1.69 x 10* 

3) Plasmin (hp) ** 74 x 10 * 
25 4) Xa 4 - 45 * 10 4 

5) Xa (hp) 4 -fl x loj 

6) Urokinase 2 '* 8 * ™ A 

7) Urokinase (hp) <10 ° 

8) Kallikrein """" 

30 9) Cathepsin G 9.49 x 10 4 

10) Activated protein C 

11) Activated protein C (hp) 

12) Elastase vt 



35 (hp) indicates in the presence of io pg/ml of heparin. 
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TABLE 2B 
2202 fHC TT) 

Protease Second Order Rate 

-Const™* f^-i) 

5 1) Thrombin 

2) Plasmin 3.36 x lo 3 

3) Plasmin (hp) <10 ° 

4) Xa 

5) Xa (hp) <10 ° 
10 6) Urokinase 

7) Urokinase (hp) <10 ° 

8) Kallikrein ~~" 

9) Cathepsin G . 
10) Activated protein C **fj x 10 

15 11) Activated protein C (hp) <10 ° 

12) Elastase 

<100 



20 



25 



(hp) indicates in the presence of io „g/ml of heparin. 

The activity of NCY 2202 (particularly in the 
presence of heparin) is analogous in some ways to that of 
NCY 2322 in that both inhibit catheprin G but not 
elastase. This is completely unexpected due to the 
similar substrate specificity. 

Type ITT Variants: Tncomorat^n ^ - S ub.,t^ 0 c- T ^ n - r 
Type III variants of the invention are produced by 
first determining the substrate sequence of a protease. 
The substrate sequences of some proteases which are known 
and which would be useful in connection with the present 
invention are put forth below in Table 3A. 
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TABLE 3 A 



PROTEASE 
Thrombin 

5 Factor Xa 

Factor XIa 
Plasmin 

10 

Urokinase 
tPA 

15 Cl-esterase 
Kallikrein 

Neutrophile elastase 



20 



Cathepsin G 
Pancreatic elastase 



SUbstral-o serpionnn 

D-Phe-Pip-Arg-pNA 
Ts-Gly-Pro-Ar g _ pNA 

bz-Ile-Glu (y 0 R) -Gly-Arg- P NA 
cbo-D-Arg-Gly-Arg-pNA 

Glu-Pro-Arg-pNA 

(D/L) -Val-Leu-Lys-pNA 
D-Val-Phe-Lys-pNA 
Ts-Gly-Pro-Lys-pNA 
Glu-Phe-Lys-pNA 

Glu-Gly-Arg-pNA 
Bz-Ala-Gly-Arg-pNA 

(D/L) -Ile-Pro-Arg-pNA 

z-Val-Gly-Arg-pNA 

(D/Bz) -Pro-Phe-Arg-pNA 

Glu-Pro-Val-pNA 
Ala-Ala-Pro-Val-pNA 

Ala-Ala-Pro-Phe-pNA 
Ala-Ala-Pro-Leu-pNA 

Ala-Ala-Ala-pNA 



Other substrate sequences can be determined by 
determining the best artificial small molecule peptide 
substrates (i.e. Ala-Ala-Pro-Phe-p N A) as determined by * 
25 and k„, or by examining the sequence of natural protein " 
substrates (e.g. fibrinogen for thrombin). 

After determining the amino acid sequence which a 
protease will bind to (i.e. its specific substrate 
sequence) , that sequence is used to replace all or a 
30 portion of the active site of PN-i. ExaBples of 
varxants are as follows: 
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10 



Record # NCY 



1 

2 



4 

5 
6 

7 
8 
9 



TABLE 3B 
mutatjjpffl sequence 



indicafri* rm 



NCY2301 
NCY2302 



anticoag. 



XaS IEGR — 

Fibrinogen DPLAGGGGVR — th^o^bin' 

inhibition 



NCY2303 


HMWK 


SPFR-SVQ 


NCY2310 


FPR 


FPR — 


NCY2311 


EPV 


EPV — 


NCY2321 


AAPF 


AAPF 


NCY2322 


AAPL 


AAPL— 


NCY2323 


AAPV 


AAPV — 


NCY2324 


AAPI 


AAPI — 



kallikrein 

inh. 
thrombin 
elastase 
cathepsin 

G. inh. 
Elastase 
Elastase 
Elastase 



specific Type in variants are 



15 Testing Type TTT Variant 
In Table 3B above, 
shown. Substrate sequences of proteases such as those 
shown within Table 3A are included within PN-1. These 
variants were tested against a panel of proteases, and 

20 the results are shown below in Tables 3C through 3J. 



TABLE 3C 
2301 fXa S) 



Protease 



Second Order Rate 
Constant fM-'fi- 1 ) 



25 l) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 
30 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 

10) Activated protein C 
35 11) Activated protein C (hp) 
12) Elastase 



<100 






2.45 


X 


10< 


2.54 


X 


10 4 


1.37 


X 


10 3 


6.83 


X 


10 3 


5.71 


X 


10 3 


<100 






<100 






<100 







(hp) indicates in the presence of 10 M g/ml of heparin. 



WO 95/11987 



PCT/US94/11624 



- 48 - 

TABLE 3D 
2303 fHMWXl 

Protease Second Order Rate 
Constan t fir'g-') 

5 1) Thrombin „ , 

2) Plasmin 2 ' 27 x lo3 

3) Plasmin (hp) <10 ° 

4) Xa 

5) Xa (hp) <10 ° 
10 6) Urokinase 

7) Urokinase (hp) <10 ° 

8) Kallikrein 

9) Cathepsin G 

10) Activated protein C 

15 11) Activated protein c (hp) 00 

12) Elastase 



20 



(hp) indicates in the presence of io Mg/ni i G f heparin. 

TABLE 3R 
2310 f FPR) 

Pro tea sb Second Order Rate 

Constant fir's-') 

1) Thrombin . 

2) Plasmin 4 - 13 x 10* 
25 3) Plasmin (hp) 5.12 x 10* 

4) Xa 1 ' 32 * 10 5 

5) Xa (hp) f' 30 x 10 2 

6) Urokinase 

7) Urokinase (hp) 5 ' 30 x 10 
30 8) Kallikrein ~~~~ 

9) Cathepsin G 

10) Activated protein C 

11) Activated protein C (hp) ? ?? < 

12) Elastase P ' *'_ 7 _ 4 _ x 10 * 

35 



(hp) indicates in the presence of 10 M g/ml of heparin. 
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TABLE 3F 
2321 fAAPF) 

Protease Second Order Rate 

Constant* fH" 1 ^ 1 ) 

5 1) Thrombin 

2) Plasmin 7.02 x 10 3 

3) Plasmin (hp) <100 

4) Xa 

5) Xa (hp) <100 
10 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G <10 ° 
10) Activated protein c <10 ° 

15 11) Activated protein c fhn) 

12) Elastase 1 

<100 



(hp) 



indicates in the presence of 10 of hepar±m 



20 TABLE 3fi 

2322 ( AAPT.) 

Protease Second Order Rate 

Consta nt fir's-*) 

1) Thrombin 

2) Plasmin 

25 3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

30 8) Kallikrein 

9) Cathepsin G <100 

10) Activated protein C 4.0 x lo 5 

11) Activated protein C (hD) 

12) Elastase K y} 

<100 



35 



(hp) indicates in the presence of io » g/ml of heparin. 
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TABLE 3H 
2324 fAAPT) 

Protease Second Order Rate 

_ Constan t fur's ') 

5 1) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 

10 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G <10 ° 

10) Activated protein c <10 ° 

11) Activated protein C (hp) 

12) Elastase ~ 

<100 



15 



(hp) 



indicates in the presence of io M g/inl of heparin. 



20 TABLE 3T 

2323 f ARPV) 

Protease Second Order Rate 

Constant rM-'fi' 1 ) 

1) Thrombin 

2) Plasmin 

25 3) Plasmin (hp) 

4) Xa 

5) Xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

30 8) Kallikrein 

9) Cathepsin G <10 ° 

10) Activated protein C <10 ° 

11) Activated protein C (hp) 

12) Elastase 1 P) 

<100 



35 



(hp) indicates in the presence of io M g/ml of heparin. 
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TABLE 3.T 
2311 CF.vy) 



Protease 

5 1) Thrombin 

2) Plasmin 

3) Plasmin (hp) 

4) xa 

5) xa (hp) 
10 6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 

10) Activated protein C 
15 11) Activated protein c (hp) 
12) Elastase " 



Second Order Rate 
Consen t fM->g-l) 

<100 



<iob 

<100 



<100 



25 



35 



(hp) indicates in the presence of io M g/ml of heparin. 
Type TV v» r <„fi., 

20 The Type IV variants of the invention are quite 

different from all of the other variants in that the 
amxno acid sequence may be the same as the naturally- 
occurring protein or different therefrom. Type iv 
variants may be generated by covalently binding (i. e 
coupling) PEG to any protein, protein fragment, or " 
peptide in general when increased biological stability is 
desired, of particular interest are those proteins 
protein fragments, and peptides which are useful in' 
therapeutic applications, such as serine protease 
inhibitor proteins, growth factors, and cytokines. The 
proteins PN-l, human growth hormone (hGH) , erythropoietin 
(EPO) , and antithrombin-lli (ATIII) are of particular 
interest. Specific exemplary proteins of interest, as 
well as exemplary classes of proteins, which can be 

modified to create a Type IV variant •,»,•„.. 

3V 1V variant using the methods of 

the invention are provided in Table 4A. 
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10 



15 



20 



25 



TABLE 4A 

Exemplary Proteins and Protein c.<i* m a for g^*^*^ 
of Type TV Variant-g 

' SPECIFIC EXEMPLARY PROTEINS 



5 interf eron-a2A 



interf eron-a2B 



insulin-like growth 
factor-1 (IGF-l) 



insulin 



human growth hormone (hGH) transforming growth factor 

1 (TGF) 



erythropoietin (EP0) 



thr ombopo ietin { TPO ) 



ciliary neurite 
transforming factor (CNTF) 



brain-derived neurite 
factor (BDNF) 



IL-l 



i nsu 1 in t r op in 



IL-2 



glial-derived neurite 
factor (GDNF) * 



IL-l RA 



tissue plasminogen 
activator (tPA) 



superoxide dismutase (SOD) | urokinase 



catalase 



streptokinase 



fibroblast growth factor 
(FGF) (acidic or basic) 



hemoglobin 



neurite growth factor 
(NGF) 



adenosine deamidase 



granulocyte macrophage 
colony stimulating factor 
(GM-CSF) 



bovine growth hormone 
(BGH) 



granulocyte colony 
stimulating factor (G- 
CSF) 



calcitonin 



platelet derived growth 
factor (PDGF) 



L-asparaginase 



bactericidal/permeability 
increasing protein (BPI) 



arginase 



uricase 



phenylalanine 



7-interf er on 



ammonia lyase 
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TABLE 4 A (cont.) 




chemokines 
gonadotropins 



chemotactins 



iamunoglobuli ns 
interleukins 



•t. V 



15 



lipid-binding proteins 
* GDNF i s the same protein as protease nexin-l (PN-l) . 

Type iv variants are created by attaching 
polyethylene glycol to a thio group on a cysteine residue 
of the protein. The PEG moiety attached to the protein 
may range in molecular weight from about 200 to 
20 000 MW. Preferably the PEG moiety will be from about 
1,000 to 8,000 MW, more preferably from about 3,250 to 
5,000, most preferably 5,000 MW. 

General methods of attaching polyethylene glycol 
to protexns are disclosed within U.S. Patent 4,179,337 
20 xssued December 18, 1979 (incorporated herein by 

reference to disclose methods of attaching polyethylene 

IT ^ Pr ° teinS) ' ^ ther ' ° ther meth ° ds of attaching 
polyethylene glycol are disclosed within U.S. Patent 

5,122 614 issued June it, 1992 , also incorporated herein 
25 by reference to disclose methods of attaching 
polyethylene glycol to proteins. 

The present inventors have discovered that novel 
modified proteins can be created by attaching the 
polyethylene glycol to a cysteine residue within the 
30 protein. Preferably, the protein is modified by 

attaching the polyethylene glycol to a cysteine residue 
at a position which is normally glycosylated. i„ another 
preferred embodiment, a cysteine residue is added at a 
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position which is normally glycosylated (e.g. 
N-glycosylated) , and the polyethylene glycol is attached 
to the thio group of the added cysteine residue, m 
addition, if the protein of interest is one member of a 
5 family of structurally related proteins, glycosylation 
sites for any other member can be matched to an amino 
acid on the protein of interest, and that amino acid 
changed to cysteine for attachment of the polyethylene 
glycol. Alternatively, if a crystal structure has been 
10 determined for the protein of interest or a related 
protein, surface residues away from the active site or 
binding site can be changed to cysteine for the 
attachment of polyethylene glycol. 

It has also been found that it is possible to 
attach other groups to the thio group of the cysteine 
residue. For example, the protein may be biotinylated by 
attaching biotin to a thio group of a cysteine residue. 
Examples of Type IV variants are as follows: 



15 



20 



25 



30 



Record 
1 
2 
3 
4 



NCY 
NCY2601 
NCY2611 
NCY2621 
NCY2631 



TABLE 4 

mutation sequence indication 
PEG-PNl lys-modified 
Biotin-PNl lys-modified 
PEG-PNl cys-modified 
Biotin-PNl cys-modified 



35 



long 

half-life 
detection/ 

coupling 
long 

half-life 
detection/ 
coupling 

Of the above examples, records numbers 1 and 2 are of a 
general type known in the art in that the polyethylene 
glycol or biotin is attached to lysine position of the 
peptide. However, record numbers 3 and 4 are, 
respectively, examples wherein the polyethylene glycol or 
biotin are connected at a cysteine group. The importance 
of such is described further below. 
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The addition of polyethylene gly col (PEG) to 
proteins as a common method to increase serum half-i lfe 
and decrease immunogenicity and antigenicity 

5 1Z1L B 1 d6SCrlbe S6Veral Pr ° teinS which have been 
5 modified by addition of peg including adenosine 

deamidase, L-asparaginase for treatment of acute 
lymphoblastic leukemia, interferon alpha 2b (iFK-^b, as 
an -"cancer and antlviral ^ superoxi(Je 

10 Z antiCanC6r drU9 ' streptokinase, tPA, urokinase, 
10 uracase, hemoglobin, interleukins, interferons, tgfh? 
EGF, and other growth factors (Nucci et al., 1991f J v 

Drug Delivery Rev. 6:133-151) . 

Typically, this modification of a protein by 

15 fun\ ^ ° f 3 B ° iety inV ° IVeS -tivating PEG with a 
15 functional group which will react with lysine residues on 
the u rface of the protein if ^ modif . cation ~ on 

protexn goes to completion, the activity of the protein 

on 5 !™ 1 1 ° St ' Procedures aiL 

only partxal PEGylation of the protein. Usually this 
20 results in only 50% loss of activity and greatly 

increased serum half-life, so that the overall dose of 
the protean reguired for the desired activity is lower 

is thatT/"" 01 ^ 16 reSUlt ° f ^ Partial »°<^icatio„ 
xs that there wall be a statistical distribution of the 
25 number of PEG groups per protein, each PEG attached to a 
lysane residue. Also, there will be a random usage of 
lysine residues on the surface. For instance, when 
adenosine deaminase is optimally modified, there is a 
loss of 50% activity when the protein has about 14 peg 
per protein, with a broad distribution of actual PEG per 
mdavadual protein and a broad distribution of actual 
lysane residues used. 

Early work on PEG modification relied on 
activating peg with cyanuric chloride and the coupling 
35 with proteins. This approach suffers from several 
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disadvantages, most noteworthy are the toxicity of 
cyanuric chloride and loss of activity. There have been 
significant advances recently to reduce the toxicity of 
the chemicals involved in the PEGylation reactions 
5 Zalipsky, et al. (Zalipsky, s., Seltezer, R. * Nho * K 
(1991) P°lY°eric nnms nna p^iv^ c y ^„__ 

describe the use of succinimydyl carbonates of PEG to 
produce stable carbamate linkages with proteins via free 
lysine residues. There are several other methods for 
0 modification of proteins with PEG through free lysine 
residues, but each suffers from the problems associated 
with partial, random modification of proteins and the 
potential for losing activity if lysine residues are 
important. 

5 The current state of the art of PEGylation 

technology could be greatly improved upon if the reaction 
could be made more specific. For instance, if the 
reactive region of the protein could be blocked during 

. iiTact 1 ::; ifc is expected ttat the pr ° tein w ° uid * 

Mil activity even after saturation modification. This 
may prove to be expensive and inconvenient (except if the 
modification took place during an affinity purification 
step where the active site is protected, and the protein 
xs purified all in one step) . In some cases to block ^ 
active site region would lead to irrecoverable loss of 
activity, and so this method could not be employed. 

Another alternative is to PEG-modify other 
residues such as His, Trp, cys, Asp, Glu, etc. in such a 
manner that activity is not lost, it is anticipated or 
Ixkely that many of these residues will be at or near the 
active site or that these residues will not be at the 
surface or in sufficient number to significantly affect 
serum half-life, or that the modification chemistry is 
not specific enough for the target residue, or the 
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without loss of activity. 



The most desirable situation would be if there 
-ere a surface distribution of an amino acid which Is 

ITZMZ r "V* 1 "- eMily "* ^"^"y and 
« away from the active site region. Cysteine is an 

ideal candidate for such modification since it s r " ely 
used at or near the active site (except cysteine " 
proteases) and modification chemistry has been well 
10 -tabUshed. ^eimido-PBC ls perhaps ^ ^ 

oa ca„„„. uslng sate-airected mutagenesis, new 
cysteme residues can be added on the surface aw,! , 
the active site region. This is ma,* rfa ° e " aWay fro "> 
15 .«(,.. . , nls 18 most conveniently and 

easily done if a x-ray crystal structure is Known! 

Even with this protein engineering strategy it i B 
not easily *„own a ^orl which surface residues " 
change to cysteine and how many PEG modification sites to 
add in order to increase stability sufficiently. There 

ind I Strate " eS ' — • «- ^ create 2 o or more 
independent singi.-site cysteine mutations, PEGyla J e „ ch 
independently and analy,e each for remaining activity 
Mutant proteins which retaln suf£icie „ t ac J 

35 aT " 9Cnerate ^ Pr0tel " " iU > mu ^tiple PEG 

25 attachment sites. 

„ raere U an ° ther Etrate 9V to identify good PEG 
attachment sites which has the advantage that one does 

and r^ 1 ! 6 *" 0 " 1 ^ ° f «- ««. dimensional structure 
and also takes advantage of the selective power of 

aT o?° n ^ ^ Sit ' S - " atU " - *~» to 

add glycosylate residues to the surface of secreted 

proteins to aid stability and increase serum half-life 

For example, asparagine residues are glycosylated when' 

Replacement of these »sn residues by cysteine, followed 
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by cyst ine-specific PEGylation is expected to lead t 
proteins with significantly increased serum half -life 
with a minimum loss in activity. To date, this is the 
closest thing to in vitro glycosylation, and in some 
5 respects may be better since glycosylation with 
inappropriate sugar residues may lead to increased 
clearance from the serum by the liver, whereas PEG 
residues are expected to be rather inert. 

If a higher degree of PEG modification is desired 
10 and the protein to be modified is one member of a family 
of structurally related proteins, other members of the 
same family will often have one or more sites of 
glycosylation not found in the protein of interest, if 
these "new" potential sites are in a region which is 
15 reasonably conserved (i.e. not part of an insertion or 
with a sequence which is so different that it is likely 
to have a different structure) it is expected that 
replacement of the residue equivalent to the Asn with 
cysteine followed by PEGylation will result in a more 
20 highly PEG modified protein without significant loss in 
activity. 

If a further higher degree of PEG modification is 
required, other solvent accessible residues can be 
changed to cysteine, and the resultant protein subjected 

25 to PEGylation. Appropriate residues can easily be 

determined by those skilled in the art. For instance, if 
a three-dimensional structure is available for the 
protein of interest, or a related protein, solvent 
accessible amino acids are easily identified. Also, 

30 charged amino acids such as Lys, Arg, Asp and Glu are 
almost exclusively found on the surface of proteins. 
Substitution of one, two or many of these residues with 
cysteine will provide additional sites for PEG 
attachment. In addition, amino acid sequences in the 

35 native protein which are recognized by antibodies are 
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usually on the surface of the protein. These and other 
methods for determining solvent accessible amino acids 
are well known to those skilled in the art. 

Modification of proteins with PEG can also be used 
to generate dimers and multimeric complexes of proteins 
fragments, and/or peptides which have increased 
biological stability potency. These diineric and 
multimeric proteins of the invention may be naturally 
occurring dimeric or multimeric proteins. For example 
the dimer or multimer may be composed of cross-linked ' 
subunits of a protein (e.g., hemoglobin). Alternatively 
the dimeric and multimeric proteins may be composed of ' 
two proteins which are not normally cross-linked (e.g a 
dimer of cross-linked EPO protein. ' 

DWic proteins of the invention may be produced 
by reacting the protein with (Maleimido) 2 -P EG , a reagent 
composed of PEG having two protein-reactive moieties 
This PEGylation reaction with the bi-functional PEG 
moiety generates dimers of the general formula: 

R,-S-PEG-S-R2 

where R, and R 2 may represent the same or different 
proteins and S represents the thio group of a cysteine 
either present in the native R, or R 2 protein, or 
introduced by site-directed mutagenesis. The proteins R 
25 and R 2 may each vary in size from about 6 to 1,000 amino' 
acids, preferably about 20 to 400 amino acids, more 
preferably 40 to 200 amino acids. m dimeric molecules 
R! and R2 are preferably from about 100 to 200 amino 
acids. Dimers and mul timers of particular interest 
30 include those composed of proteins, protein fragments 
and/or peptides which are less than about 40,000 
molecular weight. 

Where the protein contains (or is engineered to 
contain) more than one free cysteine, multimeric proteins 



20 
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where the proteins (represented by R,, Rj, RJ may be 

the same or different can be generated. The proteins 
represented by R,, r, ^ Da y vary in size from about 

6 to 1,000 amino acids, preferably 20 to 400 amino acids, 
5 more preferably 40 to 200 amino acids, such multimeric 
proteins may be of the following general formula: 

R,-(-S-PEG-S-R2) n 
where R, represents the protein having multiple free 
cysteines, for example from 2 to 20 free cysteines, 
10 usually from 5 to 7 free cysteines. R 2 may represent a 
protein the same as or different from R, . Furthermore, 
each of the R 2 proteins attached to R, may be the same ' 
proteins, or represent several different proteins. 

The degree of multimeric cross-linking can be 
15 controlled by the number of cysteines either present 
and/ or engineering into the protein and by the 
concentration of (Maleimido) 2 PEG used in the reaction 
mixture, in addition, the Maleimido-PEG+ (Maleimido) 2 -PEG 
reagents may be used in the same reaction with proteins 
20 for formation of couplings of proteins having simple PEG 
moieties as well as PEG cross-links between proteins 
within the complex. The dimeric or multimeric protein 
generated will have an increased half-life relative to 
the native, protein, due at least in part to its 
25 increased size relative to the native protein. Such 
larger proteins are not degraded or cleared from the 
circulation by the kidneys as quickly as are smaller 
proteins, in addition, activity or potency of the 
dimeric or multimeric protein may be increased. 
30 Dimeric and multimeric proteins may be generated 

by reaction with Maleimido-PEG or (Maleimido) 2 -PEG. 
Exemplary proteins for dimeric and/or multimeric complex 
formation using the method of the invention include PN-i, 
PN-l variants, hemoglobin, and erythropoietin (EPO) , as 
35 well as any of the proteins or members of the protein 
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classes exemplified in Table 4A. Preferably, the protein 
generated by the method of the invention is a PEGylated 
cross-linked complex of the -a- and »b» chains of 
hemoglobin, multimeric complexes of hemoglobin having 
5 intermolecular and/or intramolecular cross-links may also 
be generated by the subject method. 

in general, the method of identifying cysteine 
residues for PEG modification, and/or identifying amino 
acid residues to be replaced with cysteine which are 
10 subsequently modified by attachment of peg, provides for 
generation of a PEGylated protein which can be reasonably 
expected to retain most or all of the activity of the 
native protein. The sites selected for modification 
and/or substitution with cysteine are selected on the 
15 basis of the structure of the protein, i.e. the selected 
sites are solvent accessible residues which are not 
involved in the active site. 

The effect of mutations located outside of the 
active site are generally predictable in that they 
20 generally do not change the primary activity of the 

protein, m addition, the structural mutations described 
herein are within solvent-accessible regions of the 
protein (i.e. on the protein -surface") which have 
limited or no interaction with other residues in the 

ZTit i T leCUle ' ThUS ' mUtations at ««se Positions are 
, unlikely to affect the conformation of any other amino 
acid in a protein. 

TVDe V Vari^fe 

Type V variants of the invention are produced by 
30 fusing all or a fragment of another protein to PN-l 

Preferably, the amino terminal fragment of a protein such 
as urokinase is fused to PN-l in order to localize PN-l 
to a different receptor, i.e. to the urokinase receptor. 
In addition to using urokinase, it is possible to fuse 
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the amino terminal fragment f proteins such as tPA 
Factor IX, Factor X, and Protein c. Examples of Ty^e V 
variants are as follows: 



TABLE 5 



10 



15 



Record if 
1 
2 
3 
4 
5 
6 



mi 


putatiop 


sequence 


NCY2501 


ATF-PN1 


urokinaseATF 


NCY2502 


HSA-PN1 


HSA chimera 


NCY2503 


IgG-PNl 


IgG chimera 


NCY2504 


F9-PN1 


Factor IX 


NCY2505 




chimera 


F10-PN1 


Factor X 


NCY2506 




chimera 


APC-PN1 


Protein c 






chimera 



indicate 



20 



metastasis 
long 

half-life 
long 

half-life 
anti- 
coagulation 
anti- 
coagulation 
pro- 
coagulant 

The type V variants of the invention can be 
produced in manner similar to variants of type I u and 
III. However, it is more preferable to produce such 
variants by chemically fusing an N-terminal fragment of a 
different protein to the PN-1. 

The Type V variants can also be produced by 
chemical linkage of purified preparations of both protein 
components. Such linkage is conveniently accomplished by 
usxng bi-functional cross-linking reagents. Methods for 
chemically establishing such linkages are well known to 
those skilled in the art. Specific variants which might 
be produced in this manner include variants produced by 
30 fusing PN-l to any one of: EGF; Factor IX; Factor X; and 
APC. ' 

The method of PEGylation of the invention is 
intended to be a general procedure and as such is 
applicable to any protein to increase solubility 
35 circulating half life and/or to decrease immunogencity . 



25 
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Testing Type V Variants 

The Type V variants of the invention are chimeric 
proteins wherein PN-1 is covalently bound to another 
protein. The sane panel of proteases indicated above 
5 were used to test one of the Type V variants of the 
invention, and the results are put forth below. 



TABLE 5A 
2501 fATF-PN-1) 

10 Protease Second Order Rate 
xu rrorease constant f*f'c-i) 

1) Thrombin 5 5 

2) Plasmin f J * J° 

3) Plasmin (hp) x 10 

4) Xa 

15 5) Xa (hp) 

6) Urokinase 

7) Urokinase (hp) 

8) Kallikrein 

9) Cathepsin G 
20 10) Activated protein C 

11) Activated protein C (hp) 

12) Elastase 



2.3 x 10 s 



(hp) indicates in the presence of 10 Mg/ml of heparin. 

25 Use and Administration 

The different chimeric proteins, PN-l variants, 
and cysteine-PEGylated proteins of the invention (as 
indicated above) can provide different effects. For 
example, P, variants with non-polar residues such as 

30 valine substituted for the polar arginine residue could 
be used as heparin activatable inhibitors. Such 
inhibitors could be used to treat individuals suffering 
from elastase-related diseases. Although not limited to 
such diseases, such variants could be used to treat 
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35 



emphysema, congenital a-l-antitrypsin deficiency, 
inflammation, arthritis and septic shock. 

One of the most important and immediate perceived 
uses of the chimeric proteins, pn-1 variants, and/or 
5 cysteine-PEGylated versions of these proteins of the 

invention is the inclusion of such within various topical 
formulations such as creams or gels, or a combination of 
such formulations, with various bandages for application 
to wounds to aid in wound healing and decrease 
10 inflammation at wound sites. i„ that chimeric proteins, 
PN-1 variants, and cysteine-PEGylated versions of these 
proteins of the invention are believed t « be e~~ ctiv- - 
decreasing inflammation, injectable formulations^ " " 
containing the chimeric proteins, pn-i variants, a „ d /or 
15 cysteine-PEGylated versions of these proteins of the 
invention may be injected directly into inflated joints 

11° fT in ? med ° f ^ ^ *> brea- 

the inflammation. Further the formulations of the 

0 chir ti0n ^ Pr0ph ^ acti -^y ^ providing the 

0 chimeric proteins, PN -i variants, and/or 

cysteine-PEGylated versions of these proteins to a 
particular site which may be subjected to trauma, (such 
as m surgery,, and thus inflammation, to prevent the 
inflammation from occurring. 
5 Generally, the pharmaceutical compositions 

containing the chimeric proteins, PN -i variants, and/or 
cysteine-PEGylated proteins of the invention will be 
formulated in a non-toxic, inert, pharmaceutical ly 
acceptable aqueous carrier medium, preferably at a p H of 
' about 5 to 8, more preferably 6 to 8, although the 
preferred P H of the pharmaceutical composition may vary 
according to the protein employed and condition to be 
treated . 

Of particular interest in the present invention 
are cysteine-PEGylated proteins and pharmaceutical 
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c positions containing these pr tein. PEGylation of 
proteins generates proteins which are ready for immediate 
therapeutic use (i.e. do not require ^constitution, , 
have increased solubility and have an increased half -i if e 
5 and are reduced in immunogenicity and antigenicity 
relative to the unmodified protein (Nucci et al 1991 
*Tv. *rug Delivery res. 6:133-151,. The increased half- 
life of PEGylated proteins decreases the amount of 
protein needed for an effective dosage, reduces the 
10 number and frequency of administrations required, and 
decreases the patient's exposure to the protein, thus 
decreasing the potential for allergic reactions, toxic 

PECvIatV" effeCt ' Th6Se Ch — ^istics of 

15 IIZ T S all ° W l0n ^ te ™ of th. protein 

ttZT POtCntial -"ects related 

to protein immunogenicity and/or toxicity. Exemplary 
proteins for which an increase half-life has effected by 
PEGylation of the protein include: hGH, insulin 

20 ;^ rf ^ K ° n - al P ha2A (I™-alpha-2A), interferon-alpha2B 
20 (IFN-al P ha-2B,, tPA, EPO, G - CSF , antigen E, arginase 
asparaginase, adenosine deaminase, batroxobin, bovine 
serum albumin, catalase, elastase, factor viu 
galactosidase, alpha-galactosidase, beta-glucuronidase, 
IgG, honeybee venom, hemoglobin, interleukin-2 , lipase 
25 phenylalanine ammonia lyase, alpha.-proteinase inhibitor 
pro-urokinase, purine nucleoside phosphorylase, ragweed ' 
allergen, streptokinase, superoxide dismutase, tPA D- 
alpha-tocopherol, trypsin, tryptophanase, uricase, 'and 
urokinase (see in general Nucci et al. ibid.; see also 
30 Davis et al. 1981 Clin. Exp. Immunol. 46:649-652 (bovine 
adenosine deaminase); Nishimura, et al. i 98 5 Life Sci 
33:1467-1473 (batroxobin,; savoca et al. 1979 Biochimica 
et Biophysica ACTA 578:47-53 (arginase); Till, et al 
1983 J. T raun, a 23:269-277 (asparaginase); Veronese, et 
al. 1983 J. Pharm. Pharmacol. 35:281-283 (superoxide 
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dismutas ); Davis et al. 1981 Lancet 2:281-283 (urate 
oxidase); and Dellinger et al. 1976 Cancer 38:1843-1846) 

PEGylated proteins may be administered for the 
treatment of a wide variety of diseases. Exemplary 
disease conditions and the proteins useful in treatment 
of these diseases are provided in Table 6A. 



10 



Rnxrae Defici^m^ 
adenosine deaminase] 
Purine nucleotide 
phosphorylase 
Galactosidase 
^-glucuronidase 

Antioxidants for cancer Th ftynrv 
Superoxide dismutae " 
Catalase 



Endotoxi c Shorlr ^op^. 

Bactericidal/permeability 
increasing protein 
Lipid-binding protein (LBP) 



Blood Protein R ft pl n ^ nt 

Therapy 

Hemoglobin 

Albumin 



healing, inducting of r «* h^^T 
cell formation, ~*rTJ 23 

Epidermal growth factor 
G-CSP 

Interferon— y 

Transforming growth factor 
EPO 

Thrompoietin 

Insulin-like growth factor-1 
Insulin 
hGH 



Cancer 

Interferon-a 
Interferon-y 
ILl-a 

Phenylalanine ammonia lyase 
Arginase 
L-asparaginase 
Uricase 

Granulocyte colony 
stimulating factor (GCSF) 
Monoclonal antibodies 
Tissue necrosis factor 

Cardiova scular Pi 
Tissue Plasminogen 
Activator 

Streptokinase (native or 
chimeric) 

Urokinase (native or chimeric) 
a-antitrypsin 
ant it hrombin-1 1 1 
Other proteases or protease 
, inhibitors 

48j* lipoptoreins < particularl y fi - 

Circulating Scavenger Receptor 
APO Alj 

For treatment of severe combined ijnmunodef lciency 
45 I » ^"vertB low-density lipoproteins to hloh-denslty lipoprotein. 
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As discussed above, chimeric proteins, pn-1 
variants, and cysteine-PEGylated proteins may be 
delivered within the formulations and by the routes of 
administration discussed above. The particular 
5 formulation, exact dosage, and route of administration 
will be determined by the attending physician and will 
vary according to each specific situation. Such 
determinations are made by considering such variables as 
the condition to be treated, the protein to be 
10 administered, the pharmacokinetic profile of the 

particular protein, as well a various factors which may 
modify the effectiveness of the protein as a drug, such 
as disease state (e.g. severity) of the patient, age 
weight, gender, diet, time of administration, drug 
L5 combination, reaction sensitivities, tolerance to 

therapy, and response to therapy. Long-acting protein 
drugs might only be administered every 3 to 4 days, every 
week or once every two weeks, where cysteine-PEGylated 
proteins are used in the pharmaceutical composition, the 
clearance rate (i.e. the half-life of the protein) can be 
varxed to give ultimate flexibility to fit the particular 
need of the patient by changing, for example, the number 
of PEG moieties on the cysteine-protein, the size of the 
PEG moiety 

Where cysteine-PEGylated proteins are employed 
the daily regimen should generally be in the range of the 
dosage for the natural, recombinant, or PEGylated 
protein. Normal dosage amounts may vary from 0.1 to loo 
micrograms, up to a total dose of about l g, depending 
upon the route of administration. Guidance as to 
particular dosages for particular proteins is provided in 
the literature with respect to the administration of 
either native proteins and/or proteins PEGylated by 
conventional methodologies. For example, guidance for 
administration of antithrombin-IIi for the prevention of 
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5,182,259, guxdance for administration of human growth 
hormone (hCH, in the treatment of individuals intoxicated 
with poisonous substances may be found in USPN S 5,140 008 

5 and 4,816,439; guidance for administration of ^InZ 
treatment of topical ulcers may be found in USPN 
5,006,509; guidance for administration of (E p 0 ) for 
treatment of anemia and pulmonary administration of EPO 

■ »ay be found in USPN 5,354,934; guidance for 

administration of EPO, gm-csf, g-CSF, and multi-csF for 
treatment of pancytopenia may be found in USPN 5,198,417- 
guidance for administration of epo for treating iron 

ZlT, T ^ f ° Und ^ USPN 5 '° 13 ' 718 '- ^nce for 
administration of EPO in the treatment of 

15 hemoglobinopathies may be found in uspn 4,965 251- 

ZlZl ° f the treatment of 

deii! T bS f ° Und in USPN guidance for 

denary of asparaginase for treatment of neoplasms may 
be found ln USPNg 4>478 , 822 and ay 

20 administration of L-asparaginase in the treatment of 
tumors is found in USPN 5,290,773; guidance for 
administration of prostaglandin El, prostaglandin E2, 
prostaglandin F2 alpha, prostaglandin 12, pepsin 
pancreatin, rennin, papain, trypsin, pancrelipase, 

5 chymopapain, bromelain, chymotrypsin, streptokinase, 
urokinase, tissue plasminogen activator, fibrinolysin 
deoxyribonuclease, sutilains, collagenase, asparaginase, 

tral ? 3 Cry0961 banda9e f ° r treat * ent ° f -it- of 

trauma may be found in USPN 5,260,066; guidance for the 

3 administration of superoxide dismutase 

glucocerebrosidase, asparaginase, adenosine deaminase 
interferons (alpha, beta, and gamma,, interleukin ' 

W^lll: 5 ' 6 '*' USSUe neCr ° SiS faCt ° r < TOF -alpha or 
TNF-beta) , and colony stimulating factors (CSF, g-CSF 

GM-CSF, in liposomes may be found in USPN 5,225 212- ' 
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guidance for administration of asparaginase or insulin in 
the treatment of neoplastic lesions may be found in USPN 
4,978,332; guidance for administration of asparaginase i„ 
the reduction of tumor growth may be found in USPN 
5 4,863,910; guidance for the administration of antibodies 
xn the prevention of transplant rejection may be found in 
USPNs 4,657,760 and 5,654,210? guidance for the 
administration of interleukin-l as a therapy for 
immunomodulatory conditions including T cell mutagenesis 
10 induction of cytotoxic T cells, augmentation of natural ' 
killer cell activity, induction of interf eron-gamma , 
restoration or enhancement of cellular immunity, and 
augmentation of cell-mediated anti-tumor activity may be 
found in USPN 5,206,344; guidance for the administration 
15 of interleukin-2 in the treatment of tumors may be found 
in USPN 4,690,915; and guidance for administration of 
mterleukin-3 in the stimulation of hematopoiesis, as a 
cancer chemotherapy, and in the treatment of immune 
disorders may be found in USPN 5,166,322. 
20 All U.S. patents cited hereinabove are 

incorporated herein by reference with respect to the 
guidance provided in administration of the particular 
protein and/or PEGylated protein described therein. 

Several PEGylated proteins have already been 
25 approved for use by the U.S. Pood and Drug Administration 
(PDA) . These PEGylated proteins include: hGH, insulin 
interferon-alpha2A, interferon-alpha2B, tPA, EPO, g-CSF, 
and a hepatitis B vaccine which contains PEGylated 
proteins (Nucci et al. ibid). 
'0 Due to the usefulness of PEGylated proteins in 

therapy, there is a clear need for a method of generating 
PEGylated proteins at specific sites and which allows for 
the precise selection of amino acid residues for 
PEGylation, thus increasing the likelihood of generation 
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of PEGylated proteins which retain the activity of the 
unmodified parent protein. 

It is pointed out that pn-i is not found in 
significant quantities in plasma and may function 
5 primarily in tissues. The high affinity heparin binding 
site of PN-l appears to serve to localize PN-l to 
connective tissues and cells which contain sulfated 
proteoglycans on their surface and surrounding 
extracellular matrix. Thus, the primary role of pn-i 
10 seems to be in regulating proteolytic activity in tissues 
as opposed to blood. In that PN-l is found in brain 
tissue another aspect of the invention involves 
delivering formulations of the invention containing PN-l 
variants or chimeric proteins in order to facilitate 
15 peripheral or central nerve regeneration. Formulations, 
routes of administration and dosages for use of PN-l in' 
the treatment of inflammation and wounds are described in 
USPNs 5,206,017; 5,196,196; and 5,112,608; each of which 
are incorporated herein by reference to the extent that 
20 such methods of treatment using PN-l are described. 

It is generally not possible to obtain desirable 
results by administering large protein compounds such as 
chimeric proteins or protease nexin-1 and its variants by 
oral delivery systems, such proteins are generally 
digested in the GI tract (unless formulated with special 
carriers) and do not enter the cardiovascular system in 
their original forms due to such digestion. Chimeric 
proteins, PN-l variants, and/or cysteine-PEGylated 
versions of these proteins can be administered by any 
type of injection, such as intramuscular or intravenous, 
thus avoiding the GI tract, other modes of 
administration include transdermal and transmucosal 
administrations provided by patches and/or topical cream 
compositions. Transmucosal administrations can include 
nasal spray formulations which include the chimeric 
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proteins, protease nexin-l variant, and/or 
cysteine-PEGylated proteins within a nasal formulation 
which contacts the nasal membranes and diffuses through 
those membranes directly into the cardiovascular systL 
5 PEGylated proteins may have an increased ability to crl 
membranes and thus may enter the body more easily 
Formulations which include the chimeric proteins, pn-1 
variants within aerosols for intrapulmonary deliver are 
also contemplated by this invention, as « 
10 delivery systems wherein the chimeric proteins or PN-l 
variants are included within ophthalmic formulations for 
delivery m the form of eye drops. 

Any of the above suggested means of administration 

z^iz ide * in a variety ° f 

The formulations can be designed to provide the chimeric 
proteins or PN-l variants systemically or to a particular 
site Further, the formulations can be designed so as t" 
provide the chimeric proteins or PN-a varian L as J'J* 
as possible or in a sustained release or timed release" 
20 manner. For example, topical formulations could be 

created whereby the chimeric proteins or PN-l variants of 

tLicaT inCOrp ° rated or *^rsed throughout 

topic al polymer formulations capable of slowly releasing 

25 21 , Pr ° teinS ° r PN_1 VariantS to a ™- site L 

25 order to continually aid in wound healing and in 

preventing inflammation. 

As indicated above, different formulations of the 
invention can be administered in a variety of different 
manners in order to introduce the chimeric proteins PN -i 
variants, and/or cysteine-PEGylated protein into the 
cardio vascular system. The chimeric proteins, PN -i 
variants, and/or cysteine-PEGylated proteins are 
administered for a variety of purposes which generally 

>5 l^l% t0 ' 6XamPle: blOCkin9 P rote °lvtic activity; 

'5 inhibition of tumor growth or metastasis; promotion of 
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wound h aling and/or nerve fiber regeneration; 
replacement therapy for protein-deficient states (e g 
diabetes); inhibition of bacterial, fungal, or viral ' 
growth; enhancement of the immune response; induction of 
5 maturation of bone marrow stem cells (e.g. in bone marrow 
transfers) ; regulation of blood clotting; treatment of 
inflammation; or treatment of bacterial sepsis and 
endotoxic shock; replacement of albumin or hemoglobin 
(e.g. to replace blood transfusions), m particular 
10 intravenous formulations containing the chimeric 
proteins, PN-l variants, and/or cysteine-PEGylated 
versions of these proteins are useful 
anti-thrombolytic effect and therefore can be 
administered to aid and a prevention and/or alleviation 
15 of strokes and/or heart attacks. 



20 



25 



EXAMPIjFS 

The following examples are provided so as to give 
those of ordinary skill in the art a complete disclosure 
and description of how to make and use the PN-l variants 
of the invention and are not intended to limit the scope 
of what the inventors regard as their invention. Efforts 
have been made to insure accuracy with respect to the 
specifics given such as the association rate constants 
and temperature but some experimental errors and 
deviations should be accounted for. with respect to the 
formulation examples, parts are parts by weight, and any 
temperature readings are in degrees centigrade and all 
experiments were carried out at or near atmospheric 
pressure. 
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EXAMPLE A 
The Svnt-ft esis of ph- ? 
PN-1 was purified to homogeneity from serum-free 
medium conditioned by human foreskin fibroblasts in 
5 microcarrier cultures by affinity chromatography on 
heparin-agarose, followed by gel exclusion 
chromatography, as described in detail by Scott R w et 
al., J Pipl ChP„ (1985) 260:7029-7034, incorporated ' 
herein by reference. Of course, other chromatographic 
supports which contain heparin for affinity binding or 
other matrix such as cm sepharose or S-sepharose can also 
be used. The purified protein shows an M, of 42-43 kd 
based on sedimentation equilibrium analysis, or of 47 kd 
estimated from gel-exclusion chromatography. The 
15 purified material shows the properties exhibited by PN -i 
when contained in conditioned medium, including formation 
of sodium dodecylsulfate-stable complexes with thrombin 
urokinase, and plasmin; inhibition of protease activity^ 
heparin-enhanced inhibition of thrombin; and cellular ' 
20 binding of protease-PN complexes in a heparin-sensitive 
reaction. The N-terminal amino acid sequence of the 
xsoiated, purified protease nexin was determined for the 
first 34 ammo acids to be: Ser-His-Phe-Asn-Pro-Leu-Ser- 
Leu-Glu-Glu-Leu-Gly-Ser-Asn-Thr-Gly-lle-Gln-Val-Phe-Asn- 
25 Gln-Ile-Val-Lys-Ser-Arg-Pro-His-Asp-Asn-Ile-Val-ile. 

The PN-l variants of the present invention can be 
synthesized by utilizing the pure FN-i which has been 
isolated and purified in the manner indicated above The 
variants can be obtained by cleaving the purified pn-i 
30 protein at the P, or P, • site and replacing the arginine 
serine or both residues at that site with the desired ' 
non-polar substitute residue. After replacement of the 
desired residue with the desired non-polar residue, the 
segments can be fused utilizing protocols known to those 
skilled m the art. Although such methodology could be 
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utilized in order to obtain the variants of the present 
invention, this methodology is somewhat cumbersome and is 
extremely limited, due to the very small amounts of pn-i 
which can be extracted and purified. Accordingly, 
although the above procedure could be utilized, ii i s not 
the preferred method of making PN-1 or the variants 
disclosed herein. PN-1 and its variants are generally 
produced utilizing recombinant technology, as described 
below. 



10 EXAMPLE B 

A Generalized Recombinant- fiY rn-K q sis of PM ,., 
Methods of producing protease nexin-l utilizing 
recombinant technology are disclosed within published 
European patent application 873049126 which published 
15 application is incorporated herein by reference to 

disclose recombinant technologies utilized in producing 
protease nexin-l. The procedure can be modified by those 
skilled in the art, reading this disclosure, to obtain 
PN-1 variants. 

20 cDNA encoding the complete human PN-l protein was 

obtained from a foreskin fibroblast DNA library. The 
retrieval of this clone took advantage of probes based on 
the amino acid sequence determined in the native protein 
The cloned cDNA is amenable to expression in recombinant 
cells of both procaryotic and eucaryotic organisms by 
excising the coding sequence from the carrier vector and 
ligating it into suitable expression systems. 

The PN-1 can be directly produced as a mature 
protein preceded by a Met N-terminal amino acid (which 
may or may not be processed, depending on the choice of 
expression systems) may be produced as a fusion protein 
to any desirable additional N-terminal or C-terminal 
sequence, or may be secreted as a mature protein when 
preceded by a signal sequence, either its own, or a 
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heterologous sequence provided by, for example, the known 
signal sequence associated with the bacterial-lactamase 
gene or with secreted human genes such as insulin or 
growth hormones. Means for providing suitable 
5 restriction sites at appropriate locations with respect 
to the desired coding sequence by site-directed 
mutagenesis are well understood, and the coding sequence 
can thus be provided with suitable sites for attachment 
to signal sequence or fusion sequence, or into expression 
10 vectors. 

If bacterial hosts are chosen, it is likely that 
the protein will be produced in nonglycosylated form, if 
the PN-l is produced intracellular ly as a -mature- 
protein, the N-terminal methionine may be only partially 
15 processed, or not processed at all. Thus, the protein 
produced may include the N-terminal Met. Modification of 
the protein produced either intracellular^ or as 
secreted from such bacterial host can be done by 
providing the polysaccharide substances, by refolding 
using techniques to sever and reform disulfide bonds or 
other post-translational ex vivo processing techniques. 
If the protein is produced in mammalian or other 
eucaryotic hosts, the cellular environment is such that 
post-translational processing can occur in 2iyo, and a 
glycosylated form of the protein is most likely produced. 

The recombinant cells are cultured under 
conditions suitable for the host in question, and the 
protein is recovered from the cellular lysate or from the 
medium, as determined by mode of expression. 
Purification of the protein can be achieved using methods 
similar to that disclosed by Scott, R.w. et al., J_£iel 
Chem (supra) , or by other means known in the art. 

Once DNA segments coding for the production of 
PN-l have been inserted into bacterial hosts, multiple 
copies of the segments can, of course, be cloned by 
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growing the bacteria. The segments can be extracted from 
the bacteria by the use of conventional methodology 
whereby the DNA is extracted by subjecting disrupted 
cells to centrifugation and then subjecting the extracted 
5 DNA to enzyme digestion, which will result in obtaining 
the desired segments by subjecting the digested DNA to 
separation processes such as gel electrophoresis and 
blotting. The segments coding for the production of PN-l 
can then be subjected to conventional recombinant 
10 methodologies in order to substitute codons coding for 
the arginine and/or serine with new codons which code for 
the production of the desired non-polar amino acid 
residue. Once such recombinant segments are produced, 
they can be reinserted into vectors and hosts in the ' 
15 manner described above in order to obtain the production 
of the desired PN-l variants, a variety of vector and 
host systems known to those skilled in the art can be 



used. 



In addition, it is pointed out that PN-l variants 
20 might be made by using recombinant^ produced PN-l and 
then substituting only the desired «R« group (e.g., - 0H 
of serine 346) with a non-polar «R« group (e.g., 
-CH 2 CH 2 -s-CH 3 ) to get a PN-Met M6 variant, such 
replacements of the »R» group can be carried out using 
25 published protocols known to those skilled in the art. 

EXAMPLE r 

Production r>f PecombJnar.1- p N -i y ay ,- aw «. g 
in Insect Cells Using a Baculovirus Rv^-ooe^ e ^ frn 

£jJ ^ Construction of m a ^j d expression v^tnr- 

In order to produce PN-l and/or PN-l variants in 
insect cells, the cDNA sequence must first be inserted 
into a suitable plasmid expression vector, such as 
PAC373. Appropriate restriction sites for this insertion 
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can be created by standard sit -directed mutagenesis 
procedures. The essential properties of a suitable 

aTcr^ rr or inoiude * **—"**>~i 

as the polyhedron gene promoter of pAC373, and flanking 
5 homologous sequences to direct recombination into the 
baculcvirus genome. A polyadenylation signal, such as 
the one from the polyhedron gene present in this plasmid 
vector, may or may not be necessary for expression of the 
recombinant gene. A marker gene such as the B- 
10 galactosidase gene of E. coli, juxtaposed to regulatory 
seouences including a transcriptional promoter and 
possibly a polyadenylation signal, may be included in the 
vector but is not essential for expression of a convect^d 

15 Creation of reno,nh4»=, nt h^wt^.,. 

A chimeric baculcvirus is created by homologous 
rec onbi tion between the expression piasn y 

the pn-1 target gene and wild type baculcvirus dna 

20 b^n? T. Wild ^ b9CUlOVi - S « ™ co-precipitated 
20 by the calcium phosphate technique and added to 

uninfected Spodoptera frugiperda (Sf9) insect cells 
Four to seven days following transection, cells will 
exhibit a cytopathic morphology and contain the nuclear 

25 Z ,r b ° dieS typiCally produced * viral infection. 
The cell-free culture media containing both wild type and 
recombinant virus is harvested. 

^ Identification anrt <«»o.^ on n , nY%1 ■ _ 

baculovirng» 

Clonal isolates of virus can be obtained from this 
30 co-transfection stock by plaque purification on Sf9 cell 
monolayers overlaid with agarose. Candidate plaques for 
analysis will be identified by a plaque morphology 
negative for occlusion bodies, if the expression plasmid 
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contains a marker gen such as 0-galactosidase, 
recombinant plaques will be indicated by the blue color 
produced from a chromogenic substrate such as 5-bromo- 
4-chloryl-3-indolyl-b-D-galactopyranoside (X-gal) in the 
5 agarose plating medium. Picked plaques will be used for 
inoculation of cells in multiwell dishes. The resulting 
cell lysates and infected cell supernatants can be 
evaluated for expression of recombinant PN-l, using 
standard activity or immunological assays. Positive 
10 wells may require additional rounds of plaque 

purification to obtain pure recombinant virus stocks free 
from wild type contamination. 

C.4. B atch production of PN-l ; 

Sf9 cells are adapted to growth in serum-free, low 

15 protein medium such as ExCell (j.r. Scientific). Cells 
are collected from suspension culture by gentle 
centrifugation and resuspended in fresh medium containing 
the viral inoculum at a concentration of ten million 
cells per ml., using a multiplicity of infection of one 

20 virus plaque forming unit per cell. After a period of 
two hours, the culture is diluted five fold with fresh 
medium and incubated two to three days. At the end of 
that time, the cells are pelleted by centrifugation and 
the conditioned medium harvested. PN-l is purified from 

25 the cell-free supernatant by standard means. 

Variants of PN-l may be created and produced in 
the same manner as described above. 

C-sJ-: — Characterization of insect cell derived pm-i . 

PN-l produced in insect cells using a baculovirus 
30 expression system is a glycosylated protein of 
approximate molecular weight of 42,000 kd. The 
N-terminal amino acid sequence is identical to that of 
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mature kalian cell pn-1, indicating correct processing 
of the sagnal sequence. The specific activity vs 
thrombin and association kinetics, including rate 
enhancement effect of heparin, are indistinguishable from 
authentic PN-l. 



EXAMPLE p 

Production of RecpTnHnant PN-i 1 n 
E- coli in inclusion Bodies ^ soluHio » , 

D»l Cloni ng of PN-i 
10 The cloning of pn-1 and expression has been 

described (McGrogan, et al., (i 98 8) Bio/Technology,. The 
gene for PN-1 was generated by PGR from the CHO 
expression vector using the following oligonucleotides- 
PNPCR-forward 5- TG.GAA.GGA.CAT^TG.AAC.TGG.CAT CTC 
15 PNPCR-reverse 5- TCT.TTT.GTA.TAC.TjGJ^JTgA.GGG.TTT GT 
generating an Ndel and Bell site, respectively. The 
resulting fragment was cut with Ndel and Bell and 
subcloned into pGEMEX-1 vector (Promega) . 

The pgemex E. coli expression vector contains 
20 three RNA polymerase promoters. The T7 promoter is 
positioned upstream from the gene 10 leader fragment. 

We removed the gene 10 region from pGEMEX, but 
retained the T7 rna polymerase binding site and Ndel and 
BamHI cloning sites. To accomplish this the Ndel site at 
25 3251 m pGEMEX was removed by partial Ndel digest 

followed by Klenow fill-in and relegation. This piasmid 
as referred to as pT7-NK. p T 7-NK was cut with Ndel and 
BamHI to remove the gene 10 fusion protein region. The 
linear vector was isolated and ligated with the PCR- 
30 generated PN-1 linear fragment, cut with Ndel and Bell 
described above. This piasmid is referred to as p T 7PN-l 
The correct sequence was confirmed by sequencing the 
entire coding region for PN-1. 
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The native signal sequence was removed using PCR 
and the following oligos: 

PCRMET. forward S'GAT.ATA.fiMl^.TCC.CAC.TTC.AAT.CCT.CTG 
PCRMET.reverse 5 •GGG.GGC.ACT.IGT i £GA i CCC.ACA.CCG GAA 
with an Ndel and Sail site, respectively. This generated 
a 690 base pair fragment which could replace the native 
signal and a portion of the amino terminus of PN-1 with a 
start codon (Met) and the amino terminus of pn-1. Again 
the correct sequence was confirmed by sequencing. The 
expression of the resulting protein is expected to be 
intracellular, either in inclusion bodies or as soluble 
protein. 



20 



D.2. M utagenesis of PN-1 

The plasmid pT7PNl has an fi ori for the 
15 production of single-stranded DNA. Thus p T 7PNl was 
transformed into the E. coli strain CJ236 for the 
production of ssDNA to be used as a template for site- 
directed mutagenesis according to the method of Kunkel 
(Kunkel, T.A. (1988) in Nucleic Acids and Molecular 
Biology (Eckstein, F. , Lilley, D.M.J. Eds.) Vol. 2, 
P. 124, Springer-Verlag, Berlin and Heidelberg). 

The general rationale for mutant generation is 
based upon four general methods. 

In the first method, single amino acid 
25 substitutions in the region of P4 to P4' are generated by 
site directed mutagenesis. m general, substitutions at 
the Pi site will have the most dramatic effects. 
However, substitutions at other residues within the 
active site region will give changes in association rate 
30 constants with serine proteases. 

In the second method, sequences found at the 
active site region of other serpins were grafted onto 
PN-l. a number of combinations must be created to 
determine how much of the sequence at the active site 
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must be changed to change the specificity and kinetics. 
The p 4 to P 4 » region is generally found to be most 
important, but amino acids residues outside this region 
can have pronounced affects on protease inhibition. 
5 in the third method, sequences which have been 

found to be particularly good substrates are added to or 
used to replace a sequence of PN-1. Prior to making 
these mutants, it was not clear if these changes would 
ruin the inhibitory effects of PN-l and turn PN-l from an 
10 inhibitor of proteolysis into a substrate, m fact, 
incorporation of Ala-Ala-Pro-Phe , a good substrate 
sequence for subtilisin, into PN-l results in a molecule 
which is cleaved particularly well by subtilisin. 
However, PN-l variants have now been obtained which are 
15 good inhibitors of mammalian serine proteases based upon 
this approach. 

In the fourth method, optimum inhibitor sequences 
can be generated by using a phage display system, since 
PN-l forms covalent interactions with the target 
protease, it is important that one is not selecting for 
mutants which bind more tightly than the parent PN-l 
molecule. Rather, one selects for PN-l variants which 
bind more rapidly to the target protease by allowing 
phage-displayed variant PN-l library to interact with the 
25 immobilized target protease for only short times. Thus, 
only rapid-binding variants will be selected. This is a 
novel application of the phage display system. 

The following oligonucleotides were used to 
generate a mutant specific for the protease shown at the 
30 end of the sequence: 

5'GCA.ATT.CTC.ATT.GCA.NN(G/C) .TCA.TCG.CCT.CCC [R345I, 
R345M, R345L, R345V] (elastase) , R345K (plasmin) , R345D, 

R345E 5'ATT.CTC.ATT.GCA.GTG.AGC.TCG.CCT.CCC.TG R345V 
(elastase) 
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5 'ATT. CTC. ATT. GCA. AGA. ATA. TCG. CCT. CCC. TGG S3461 
(Factor Xa) 

S'ATT.CTC.ATT.GCA.AGA.ACA.TCG.CCT.CCC.TCC S346T 
(Factor Xa, Cl-esterase) 

5 5 • ACA . ACT . GCA . ATT . CTG . GCT . GGA . AGA . TCA . TTG . AAT . CCC TGG 
TTT.ATA I343A;A344G;S347L;P348N (ATIII-liJce, thrombin' 
Factor Xa) ' 

5 • ACA . ACT . GCA . ATT . CTC . TTT . CCA . AGA . TCA . TCG . CCT . CCC 
I343F;A344P (FPR, thrombin) 

10 ACT . GCA . ATT. CTC . ATT . CCA . TTA . TCA . TCG . CAG . GTC. CGG . TTT. 

ATA.GTA.GAC A344P;R345L;P348Q;P349V; W350R (HCII- 

like) 

5'GAA.GAT.GGA.ACC.AAA.GCT.TCA.GAC.TTT.TTG.GCT.GAA.GGT 

GGC . GGT . GTA . AGA . TCA . TCG . CCT . CCC . TGG A336D;A337F- 

15 T338L;T339A;A340E;I341G;L342G;A344V (fibrinoaen- 

like, thrombin) 

5 • GCA . ACA . ACT . GCA . ATT . ATC . GAG . GGA . AGA . TCA . TCG . CCT 

L342I;I343E;A344G (Factor Xa) 
5 • ACA . ACT . GCA . ATT . CTC . GAG . CCA . GTA . TCA . TCG . CCT . CCC 
20 I343E;A344P;R345V (elastase, cathepsin G) 

5 ' ACT . GCA . ATT . CTC . ATT . GGA . AGA . TCA . TCG . CCT A344G (faster 
kinetics) 

S'ACT.GCA.ATT.CTC.ATT.CCA.AGa'.TCA.TCG.CCT A344P (faster 
kinetics) 

25 5'GCA.ACA.ACT.GCA.ATT.AGC.CCT.TTC.AGA.TCA.GTG.CAG.CCC 
TGG. TTT.ATA L342S;I343P;A344F;S347V;P348Q (high 
molecular weight kininogen-like; kallikrein) 

5 ' GCA . ACA . ACT . GCA . ATT . GCC . GCT . CCA . TTC . TCA . TTG . CCT . CCC . 

TGG. TTT L342A/I343A;A344P;R345F (cathepsin G) ' 

30 5'GCA.ACA.ACT.GCA.ATT.GCC.GCT.CCA.GTA.TCA.TCG.CCT.CCC. 

TGG . TTT L342A;I343A/A344P;R345L (elastase) 

5'GCA.ACA.ACT.GCA.ATT.GCC.GCT.CCA.CTA.TCA.TCG.CCT.CCC. 

TGG. TTT L342A;I343A;A344P;R345L (elastase) 
5 » GCA . ACA . ACT . GCA . ATT . GCC . G CT . CCA . ATA . TCA . TCG . CCT . CCC . 
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T6G.TTT L342A;I343A;A344P;R345I (elastase) 

Expression and PmHf f catinn of> P rotease Vovi „ 

Variants in K r ™]i 

JM109 (DE3) contains a chromosomal copy of the 
gene which codes for T7RNA polymerase under the control 
of the inducible lac promoter. JM109 (DE3) containing 
PT7PN-1 (or a variant of PN-i) was grown overnight in 
2xYT + 0.2% glucose + 100 mg/ml carbenicillin at 28-32'C. 
Low temperature and high nutrient containing solution is' 
helpful in generating productive innoculants. The 

inoculum was di i 1 *- - - ■ 

was ax* x.250 to i:suu ana grown to OD^-l 

in a shake flask or -50 in a fermentor at 26-37'C and 

induced with IPTG at O.l-i.o mM for 4-16 hours, The 

bacteria were collected by centrifugation, resuspended in 

15 10 mM TRIS, pH 8, 1 mM EDTA, and disrupted by high 

pressure homogenization. Inclusion bodies were collected 

by centrifugation, washed with 1 M NaCl, o.05% 

triethylamine, and the protein refolded from a 6 M 

guanidine solution by rapid dilution, pn-1 was purified 

20 by capture on Pasts sepharose and eluted with 0.6 M NaCl 

diluted to 0.25 M NaCl and passed over PastQ sepharose to 

remove endotoxin and recaptured on Fasts sepharose and 

eluted with 0.6 M NaCl or a gradient of 0.25 to 1 M NaCl. 

Alternatively, PN-1 can be generated in a soluble 

25 form within E. coli by adjusting the fermentation 

conditions. This procedure provides a greater yield of 

soluble PN-l as the fermentation temperature is decreased 

from 37 »C to 26'C with a concomitant loss in inclusion 

body material. This is quite an unexpected finding, 

30 since PN-1 is bactericidal when native PN-1 is added to 

E. coli. to purify soluble PN-l, the cell supernatant 

from the disruption step was clarified by centrifugation 

and filtration or by treatment with polycations such as 

polyethyleneimine or Biacryl*" followed by centrifugation 
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and filtration, and the soluble protein was purified as 
above. The generation of soluble material has many 
advantages: there is more certainty that the protein is 
correctly folded, there are no refolding steps, there is 
5 greater reproducibility from batch to batch. 

The production of PN-1 was about 50 mg per 
gram of cell paste. This corresponds to about 50 mg per 
liter of production at a cell density of 1 OD^ or up to 
2.5 grams of soluble PN-l per liter of fermentation. 
10 This represents a substantial advance in the state of the 
art of PN-l production. 

D.4. Activity assay for PN-1 variant-g 

Refolded or soluble protein was tested for 
capacity to inhibit thrombin in a standard assay. 
15 Briefly, serial 2-fold dilutions of PN-l variant were 

added to microtiter plate wells (50 Ml/well) , followed by 
50 Ml of a 30 ng/ml heparin solution, followed by l NIH 
unit of thrombin in 50 til. These were allowed to 
incubate at 25»c for 15 minutes. Residual thrombin 
20 activity was measured by the addition of 50 M l S-2238 
(Kabi Pharmaceuticals) at 0.625 mg/ml. PN-l variants 
were tested for their ability to inhibit urokinase using 
the substrate S-2444, plasmin using the substrate S-2390, 
tPA using the substrate S2288, Factor Xa using the 
25 substrate S-2222 or S-2765, kallikrein using the 
substrate S-2302, human neutrophil elastase using 
s-AAPV-pna (Sigma), cathepsin G using s-AAPF-pna (Sigma) 
in a similar manner, with or without the addition of 
heparin. 

30 The second-order rate association constants were 

determined for appropriate inhibitors-proteases 
combinations by combining egual-molar amounts of each 
protein (determined by titration as above) for various 
times from l second to 4 hours (as appropriate) and 
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following the activity loss. The t m was estimated from 
resulting curves. The was estimated according to the 

equation ln2/ [pn-1] x t I/2 . Alternatively, the apparent 
first order rate constant was determined from the slope 
of a plot of log (normalized activity) vs time. The 
second order rate constant was calculated by dividing the 
apparent first order rate constant by the PN-1 (or 
variant) concentration used. . 

EXAMPLE B 
Generation of a tf-pui ph^ Pra 
To generate this chimera, we first cloned the 
amino-terminal fragment of uPA by PCR using the 
oligonucleotides: ATF. forward 

5 • GGT . GAT . CAT . ATG . AGC . AAT . GAA . CTT . CAT . CAA 
15 ATP. REVERSE 

5 "TTT . AGG . ACG . CGT . CTG . CGC . CAT . CTG . CTC . AGT . CAT . G 
generating a Ndel and Mlul site respectively. The 
resulting fragment was cut with Ndel and Mlul and 
subcloned into P T7PN1 vector cut with the same enzymes to 
20 move the signal sequence. This plasmid is referred to as 
PT7ATF-PN1. The correct sequence was verified by 
automated sequencing using the T7 dye primer system 
(ABI) . 

To generate an even shorter version of ATF-PN1 
25 which retains the urokinase receptor binding region, 

ATF 48 -pni was made by introducing a Mlul site (underlined) 
by site directed mutagenesis at codon 48 using the 
following oligonucleotide: 

5 • CAC . TGT . G AA . ATA . G AT . AAC^GCG.T AA . AC C . TG C . T AT . G AG . 
30 The resulting plasmid was cut with Mlul to remove a 
300 bp segment and ligated. 
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Mutagenesis of ATF-PNi 

The plasmid pT7ATF-PNl has an fl ori for the 
production of single-stranded DNA. Thus P T7ATF-PN1 was 
transformed into the E. ooli strain CJ236 for the 
5 production of ssDNA to be used as a template for site- 
dxrected mutagenesis according to the method of KunJcel 
(Kunkel, T.A. (1988) in Nucleic Acids and Molecular 
Biology (Eckstein, F. , Lilley, D.M.J. Eds.) Vol. 2 
P- 124, Springer-Verlag, Berlin and Heidelberg) ' Jn 
10 addition, PN-l variants of interest can be subcloned into 
pT7 ATF-PN- 1 by using standard molecular biology 
techniques . 

Expression and PurifHn a «- j on of vtv-pmi 

The resultant plasaid was transformed into the 
E coli strain JM109 (DE3 ) , grown to OD^ - x in a shake 
flask or -50 in a fennentor, and induced with IPTG at 
0.1-1.0 mM for 4-16 hours at 26-37^. The bacteria were 
collected by centrifugation, resuspended in 10 mM TRis 
PH 8, 1 mM EDTA, and disrupted by high pressure 
20 homogenization. Inclusion bodies were collected by 

centrifugation, washed with 1 M NaCl, 0.05 % TEA, and the 
protein refolded from a 6 M guanidine solution. ATP-PNl 
was purified by capture on Fasts sepharose and eluted 
wxth 0.6 M NaCl, diluted to 0.25 M NaCl and passed over 
25 FastQ sepharose to remove endotoxin and recaptured on 
Fasts sepharose and eluted with 0.6 M NaCl or a gradient 
of 0.25 to 1 M NaCl. Alternatively, the cell supernatant 
from the disruption step was clarified by centrifugation 
and filtration, and the soluble protein was purified as 
3 0 above . 

Activity assay fn r ATF-PM1 

Refolded or soluble protein was tested for 
capacity to inhibit thrombin in a standard assay. 
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Briefly, serial 2-fold dilutions of ATF-PN1 were added to 
inicrotiter plate wells (50 m/well) , followed by 50 of 
a 30 fig /ml heparin solution, followed by i NIH unit of 

5 ftrrT". 50 Th6Se all ° Wed t0 lnc ^te at 25-C 

5 for is minutes. Residual thrombin activity is measured 
by the addition of 50 M i s-2238 (kabi Pharmaceuticals) at 
0.625 mg/ml. ATF-PN1 was tested for its ability to 
inhibit urokinase using the substrate S-2444, or plasmin 

10 the addition of heparin. 

Refolded or soluble protein was tested for the 
ability to bind to a soluble form of the urokinase 
receptor as measured by ELIZA, or the ability to inhibit 
the bxnding of urokinase to the soluble urokinase 

15 receptor. ATF-PN1 was also tested for its ability to 

inhibit UPA or DFP/PMFS treated uPA binding to cells such 
as HT1080, U937 , or ^ expressing upA receptor> 

EXAMPLE F 

Generation of cy gi-^ ne-PECy is^oH Prof .^„ e 

20 £ -1 Preparation of *f »i e iinirfn-Pire t>^^^ t 

Maleimido-PEG was prepared by mixing the 
following: 

1) 100 mg methoxypolyethylene amine (20 ^mol) 

(MW«5,000) 

25 2) 20 /imol 7-maleimidobutyric acid-N-hydroxy 

succinimide ester (GMBS) 
3) 2 ml loo mM Caps buffer, pH lo.o 

The amount of the components above (particularly i) and 
2)) and the volume indicated may be varied. For example 
30 it is permissible that the difference in the ratio of ' 
methoxypolyethylene amine to GMBS can vary by up to ten 
to 100-fold. Normally, about a two-fold excess of 1) 
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above to GMBS is preferred, while various buffers nay be 
substituted for the Caps buffer, it is important that 
Tris buffers are not used in this mixture, as Tris 
buffers will quench the reaction. The p H of the buffer 
5 used may vary considerably, although buffers having a P H 
of 10.0 are preferable over buffers having a pH of 8.0. 
in addition, the mixture above may contain up to 50% DMSO 
as a cosolvent. it is particularly important that the 
reaction mixture does not contain a reducing agent such 
10 as dithiothreitol (DTT) or 0-mercaptoethanol (0ME) . 

The mixture was incubated at 37»c for 30 minutes, 
although the reaction temperature may be as low as 4«c 
and the reaction time may be extended for up to one hour 
or more. After incubation, 12 mg of Tris free base or 
15 ethanolamine was added to the mixture to quench the NH 3 
moiety. This quenching step may be omitted. 

The reacted mixture is purified by elution through 
a PD-10 column (G-25) (BioRad) equilibrated with 20 mM 
Tris ( PH 7.4), 100 mM NaCl, and 0.1% Tween. The eluant 
20 was collected in 0.5 ml fractions and assayed for 

production of the Maleimido-PEG reagent by precipitation 
with 50% TCA. The resulting Maleimido-PEG (Mal-PEG) 
reagent is then used in to modify a selected protein by 
attachment of PEG to a cysteine residue (s) . 

25 — Reaction of Mal^i' mido-PEc: w j th Prnto{n 

Prior to reaction of the protein with the 
Maleimido PEG reagent, the purified protein was diluted 
to a concentration of about 200 M g/ml to l mg/ml in any 
suitable buffer which does not contain DTT or 0ME. 

30 Normally, the buffer was composed of 20 mM PIPES pH 6.75, 
0.6 M NaCl, and 1% glycerol. Approximately 10 nl to 
40 ,il of the diluted protein was used for the PEGylation 
reaction. The Maleimido-PEG reagent described in section 
F.l was diluted in a series of 2-fold dilutions using 
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10 Ml transfers of solution containing approximately i 
Hi Maleimido-PEG in a 10 „i volume of buffer composed of 
20 mM Tris pH 7.4, 0.1 M NaCI, and 0.01% Tween. The 
ratio of the maleimido-PEG to protein may be varied 
5 according to the preferred level of PEGylation of the 
protein desired. Up to 20-fold excess of maleimido-PEG 
to protein still provided for specific reaction of the 
reagent with cysteine residues of the protein. 

The protein and maleimido-PEG were incubated for 
10 one hour at room temperature, although this reaction may 
be performed at 4-c for longer periods of time, a sample 
of the reacted 

Sj c= oiiaxyiuea uy 5DS-PAGE to 

determine the minimal amount of maleimido-PEG reagent 
needed for complete coupling. 
15 The reaction described above may be used to 

determine the proper ratio of Maleimido-PEG to protein 
and then scaled up to produce commercially acceptable ' 
amounts of PEGylated protein. 



20 



F.3 Preparation »f f wi-i^.,^. r Bg Rean<an1 . 

(Maleimido) 2 -PEG is prepared by mixing the 
following: 



25 



30 



1) polyethylene bis [amine] 

2) 20 Mmol 7-naleimidobutyric acid-N-hydroxy 

succinimide ester (GMBS) 

3) 2 ml 100 mM Caps buffer, pH 10.0 

The amount of the components above (particularly 1) and 
2)) and the volume indicated may be varied. For example 
it is permissible that the difference in the ratio of io' 
and 2) can vary by up to 10 to 100 fold, although an 
excess of GMBS to 1) above is preferred, normally about a 
2-fold excess. While various buffers may be substituted 
for the caps buffer, it is important that Tris buffers 
are not used in this mixture, as Tris buffers will quench 
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th reaction. Th pH of the buffer used may vary 
considerably, although buffers having a pH of io o are 
preferable over buffers having a p H of 8.0. m addition 
the mixture above may contain up to 50% DMSO as a 
5 cosolvent. it is particularly important that the 

reaction mixture does not contain a reducing agent such 
as dithiothreitol (DTT) or 0-mercaptoethanol (0ME) . 

The mixture is incubated at 37«c for 30 minutes, 
although the reaction temperature may be as low as 4»c ' 
10 and the reaction time may be extended for up to one hour 
or more. After incubation, 12 mg of Tris free base or 
ethanolamine is added to the mixture to quench the NH 3 
moiety. This quenching step may be omitted. 

The reacted mixture is purified by elution through 
15 a pd-io column (G-25, (BioRad) equilibrated with 20 mM 
Tris ( P H 7.4), 100 mM NaCl, and 0.1% Tween. The eluant 
is collected in 0.5 ml fractions and production of 
(Maleimido) 2 -PEG is assayed by precipitation with 50% TCA 
The resulting (Maleimido) 2 -PEG (Mai -peg) reagent is then ' 
20 used in to modify a selected protein by attachment of peg 
to a cysteine residue (s) . 

.F.4 Reaction of fMalei^^ny- ^G vith p ^ a ^ 

Prior to reaction of the protein with the 
(Maleimido) 2 -PEG reagent, the purified protein (e.g. a 
25 pn-1 mutant containing a cysteine residue at position 99) 
as diluted to a concentration of about 200 ng/ml to 
1 mg/ml in any suitable buffer which does not contain DTT 
or 0ME. Normally, the buffer is composed of 20 mM pipes 
PH 6.75, 0.6 M NaCl, and 1% glycerol. Approximately 
30 io M l to 40 M l of the diluted protein is used for the 
PEGylation reaction. The (Maleimido) 2 -PEG reagent 
described in section F.i is diluted in a series of 2-fold 
dilutions using io M l transfers of solution containing 
approximately 1 m! (Maleimido) 2 -PEG in a 10 Ml volume of 
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buffer composed of 20 mM Tris pB 7.4, 0.1 M Ma ci, and 
0.01% Tween. The ratio of the maleimido-PEG to protein 
may be varied according to the preferred level of 
PEGylation of the protein desired. 
5 The protein and (Maleimido) 2 -PEG are incubated for 

one hour at room temperature, although this reaction may 
be performed at 4-c for longer periods of time. A saBple 
of the reacted mixture may be analyzed by SDS-PAGE to 
determine the minimal amount of (Maleimido) 2 -PEG reagent 
10 needed for complete coupling. 

The reaction described above may be used to 
determine the proper ratio of (Maleimido) 2 -PEG to protein 
and then scaled up to produce commercially acceptable 
amounts of PEGylated dimeric or multimeric proteins. 

EXAMPLR a 

Generation of CvstAino-P EGvlal-^ PM - ! VaHa nf e 

fTvoe TV V a r,- aB f.| 
G.l Selection of amino ^ id of PM _., ^ 

20 substitution bv ^f,^., 

PN-la and PN-10 contain N-glycosylation sites at 
ammo acid residue positions 99 and 140. Therefore 
these sites were selected for site-directed mutagenesis 
to replace the asparagine at one or both of these 
25 positions with cysteine. 

Three sites in PN-l were selected for replacement 
with cysteine on the basis of the presence of 
glycosylated residues at a corresponding site in a 
protein homologous to PN-l. Amino acid residue D192 was 
30 selected for replacement with cysteine since the proteins 
angiotensin and Rab 0RF1, each which are homologous to 
PN-l, are N-glycosylated at the amino acids corresponding 
to this residue in PN-l. Amino acid residue E23 0 was 
selected for replacement with cysteine since baboon a,- 
35 antitrypsin (a,-AT) , which is homologous to PN-l, is 
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10 



15 



glycosylated at the amino acid residue corresponding to 
this position in PN-1. Amino acid residue H252 was 
selected for replacement with cysteine since Oj- 
antiplasmin (a 2 -AP) , another protein homologous to pn-i, 
is glycosylated at the corresponding residue. 

Other amino acid residues were selected for 
replacement with cysteine on the basis of the position of 
the amino acid within the three-dimensional structure of 
PN-l as determined by X-ray crystallography (Figure 3) . 
The approximate position of the amino acid residues 
selected for cysteine substitution are indicated by their 
corresponding amino acid residue number. The particular 
amino acid residues identified for mutagenesis in the 
present example were selected on the basis of the 
apparent solvent-accessibility of the amino acid and the 
apparently few number of interactions with other amino 
acids in the protein. 

— Site-directed Mufc a ^ on of pkt-i to p^^< no 

The mutations selected above were generated in PN- 
20 la using site-directed mutagenesis as described in 
section D.2. Although PN-la was employed in these 
experiments, the same mutations in PN-10 are likely to 
provide the same effects as all the mutations were 
introduced into the region of amino acid sequence 
25 identity between these nearly identical proteins. Briefly, 
DNA encoding PN-1 was inserted into the plasmid pT7PNl 
which has an fl ori for the production of single-stranded 
DNA. This plasmid was then transformed into the E. coli 
strain CJ236 for the production of ssDNA to be used as a 
template for site-directed mutagenesis according to the 
method of Kunkel (Kunkel, t.A. (1988) in Nucleic Acids 
and Molecular Biology (Eckstein, P., Lilley, D.M.J. Eds.) 
Vol. 2, p. 124, Springer-Verlag, Berlin and Heidelberg). 



30 
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The oligonucleotides used to generate specific 
mutations within the PN-l coding region are as follows: 

H140C AAT GCA TGG GTT AAA AAC GAA ACC AGG GAT 
AAT GCA TGG GTT AAC *GC GAA ACC AGG GAT 
Hpal 

N99C GCC GTG TTT GTT AAG AAT GCC TCT GAA ATT 
GCC GTG TTT GTT AAC TGT GCC TCT GAA ATT 
Hpal 

P28C GTG AAG TCG AGG CCT CAT GAC AAC ATC GTG ATC 
10 GTG AAG TCG AGG TGC CAT GAC AAC ATC GTG ATC 

G52C CTG GGG GCG G AC TGC AG G ACC AAG AAG 

PstI 



N85C 

15 



GTC TCC AAG AAG AAT AAA GAC ATT GTG ACA GTG GCT 
GTC TCC AAG AAG TGC AAA GAT ATC GTG ACA GTG GCT 

EcoRV 



Q116C AAA GAT GTG TTC CAG TGT GAG GTC CGG 
AAA GAT GTG T TC TGC AG T GAG GTC CGG 
PstI 

N304C TCA TCA AAG GCA AAT TTT GCA AAA ATA ACA 
20 TCA TCA AAG GCA TGC TTT GCA AAA ATA ACA 

SphI 

SIC GAT ATA CAT ATG TCC CAC TTC AAT CCT CTG TCT CTC GAG 
GAT ATA CAT ATG TGC CAC TTC AAT CCC_TTA_^GT CTC GAG 
25 GAA CTA GGC Aflll 

GAA CTA GGC 

R63C ^ AAG ^ CTC GCC ATG GTG ATG &GA TAC GGC GTA AAT 
AAG AAG CAG CTC GCA ATG G TG ATG TGC TAC GGC GTA AAT 
Ncol destroyed 

30 E125C GTC CGG AAT GTG AAC TTT GAG GAT CCA GCC TCT 
GTC CGG AAT GTT AAC TTT TGC GAT CCA GCC TCT 

Hpal 

D147C AGG GAT ATG ATT GAC AAT CTG CTG TCC CCA GAT CTT ATT 
AGG GAT ATG ATT TGC AAT C TC TTA AG P CCA GAT CTT ATT 
J * Aflll 
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D192C TTC GTG GCA GCA GAC GGG AAA TCC TAT 
TTC GTG GCA GCA TGC GGG AAA TCC TAT 
SphI 

E230C CCC TAC CAC GGG GAA AGC ATC AGC ATG 
5 CCC TAC CAC G GC TGC AG C ATC AGC ATG 

PstI 

H252C GCC ATC ATC CCA CAC ATC AGC ACC AAG ACC ATA GAC 
GCC ATC ATC CCA TGT ATC AGT ACT AAG ACC ATA GAC 

Seal 

10 S2 63C ACC ATA GAC AGC TGG ATG AGC ATG GTC 

ACC ATA GA C AGT TG G ATG TGC ATC ATG GTC 
PvuII destroyed 

P2 67C AGC ATC ATG GTC CCC AAG AGG GTG CAG 
AGC ATC ATG GTC TGC A AA CGC GT G CAG 
15 Afl III 

D284C GCT GTA GCA CAA ACA GAT TTG AAG GAG CCG CTG 
GCT GTA GCA CAA ACA TGT TTA AA G GAG CCG CTG 

Dral 



20 



The mutants are named according to the single-letter code 
for the amino acid residue in the native protein, the 
number of the position of that amino acid within the 
amino acid sequence of PN-1, and the single-letter code 
for the amino acid residue substituted at that site. For 
example, the mutant SIC produces a PN-1 protein which has 
25 the serine at position 1 replaced by cysteine. The top 
sequence for each mutant above indicates the wild type 
PN-1 sequence, while the sequence below indicates the 
mutation introduced in the coding sequence of the mutant . 
The nucleotides in bold are changed relative to wild 
30 type. The codon which is double-underlined is the newly- 
introduced codon for cysteine. The underlined sequences 
in the mutated DNA sequence indicate a restriction enzyme 
site which is introduced into or removed from the 
nucleotide sequence of the mutant. Introduction or 
35 removal of these restriction sites do not alter the amino 



WO 95/11987 



PCT/US94/11624 



- 95 - 

acid sequence encoded at that site, but provide a means 
for screening clones containing DNA subjected to site- 
directed mutagenesis for incorporation of the 
oligonucleotide sequence into the PN-l coding sequence. 
5 Introduction of the desired mutation was confirmed by 
restriction enzyme analysis. 

Mutant PN-l proteins containing multiple cysteine- 
substituted residues were generated by introduction of a 
first mutation by site-directed mutagenesis as described. 
10 After confirmation of the insertion of the first mutation 
by restriction enzyme analysis, the DNA was subjected to 
a second round of site-directed mutagenesis using a 
different oligonucleotide. For example, the double 
mutant N99C;N140C was generated by site-directed 
15 mutagenesis with the N99C oligonucleotide and 

confirmation of the presence of the newly introduced Hpal 
site in the coding sequence. The N99C mutant DNA was 
then subjected to a second round of site-directed 
mutagenesis with the N140C oligonucleotide. Table G.5A 
20 below lists the single, double, and triple mutants 

generated using these techniques and the oligonucleotides 
described above. 

DNA encoding the mutant PN-l proteins were 
expressed and the expressed proteins purified as 
25 described in section D.3. 

G.3 Reaction of PN-l and PN-l Mutant* w ith MalP^i^-i,^ 
Reagent 

After purification of PN-l and the PN-l mutants 
described above, each protein was reacted with the 
30 Maleimido-PEG reagent described in P.i according to the 
general protocol of F.2. For example, the mutant 
N99C;N140C was cysteine-PEGylated using the following 
protocol. 
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Purified N99C;N140C protein was diluted to a 
concentration of about 200 fig/ml in 20 mM PIPES pH 6.75, 
0.6 M NaCi, 1% glycerol. Approximately 40 fil of the 
diluted protein (0.25 nmol) was used for the PEGylation 
5 reaction. The Maleimido-PEG reagent described in section 
F.l was diluted in a series of 2-fold dilutions using 
10 fil transfers starting from approximately 
2 Ml Maleimido-PEG in a 10 fil volume of buffer composed 
of 20 mM Tris pH 7.4, 0.1 M NaCi, and 0.01% Tween. This 
10 reaction contained a two-fold excess of the Maleimido-PEG 
reagent over that required for PEGylation of the number 
of cysteine sites in the PN-1. 

The N99C;N140C protein and maleimido-PEG mixtures 
were incubated for one hour at room temperature. A 
15 sample of each of the reacted mixtures was analyzed by 
SDS-PAGE. Analysis of this gel revealed that the band 
migrating at the relative molecular weight of unmodified 
N99C;N140C PN-1 disappeared as the ratio of Maleimido-PEG 
to protein increased. Accordingly, as the amount of 
20 unmodified N99C;N140C PN-1 in the sample disappeared with 
increasing Maleimido-PEG concentrations, distinct bands 
migrating at molecular weights of increasing intervals of 
approximately 5,000 MW appeared. Thus, reaction of the 
PN-1 variant produced distinct cysteine-PEGylated 
25 proteins containing increasing numbers of PEG units per 
protein molecule, up to 2 PEG per PN-1 molecule, the 
maximum number of cysteines available in the N99C;N140C 
PN-1 variant. 

Distinct bands representing proteins increasing in 
30 relative molecular weight by 5,000 MW intervals is 

evidence of the specificity of the Maleimido-PEG reaction 
for attachment of PEG to cysteine residues. If the 
reaction had resulted in PEGylation of residues other 
than cysteine, a smear of proteins would appear on the 
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gel, indicating the presence f proteins containing an 
infinite number of PEG moieties. 

A sample of the cysteine-PEGylated protein which 
was reacted at a ratio of 2:1 Maleimido-PEG to protein 
was tested for activity using the assay described in D.4. 
This sample, which primarily contains cysteine-PEGylated 
protein, retained at least 100% of the activity of 
unmodified PN-l. The specific activity of the PEGYlated 
proteins increased with an increasing amount of the 
Maleimido-PEG reagent present in the reaction mixture 
(Figure 4) . These data suggest that an increase in 
activity is often found upon increasing PEG modification, 
which may result from the increased solubility and/or 
activity of PEGylated PN-l. 

15 ^ — Generation of PEGvla ted pw-i Mutant iftHp rj 
Conventional Method fr omparative Examp le) 

In order to compare the results obtained above 
with the PEGylation methods known in the art, the 
N99C;N140C PN-l variant was PEGylated using a method 
20 similar to that described by Zalipsky in USPN 5,122,614, 
with the substitution of a paranitrophenol carbonate of' 
PEG for the N-succinimide carbonate of PEG used by 
Zalipsky as the activated carbonate. The protocol used 
was otherwise identical. Ratios ranging from 1:1 to 
25 100:1 of activated PEG to PN-l mutant (N99C;N140C) were 
used in the reactions. 

A sample of each of the reacted mixtures 
containing dilutions of the PEGylation reagent was 
analyzed by SDS-PAGE. Analysis of this gel revealed that 
30 the amount of protein migrating at the molecular weight 
of the unmodified PN-l variant decreased with increasing 
concentrations of the PEGylation reagent of Zalipsky used 
in the reaction. However, in contrast to the distinct 
bands generated using the method of the invention, a 
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sm ar of proteins of various, increasing molecular 
weights appeared as the unmodified protein disappeared. 
The PEGylated proteins produced by the conventional 
method contained various numbers of PEG moieties per 
5 protein molecule, suggesting that attachment of PEG was 
random and that any number of lysine residues to various 
positions were modified. 

The sample which contained various levels of PEG 
modification were tested for activity in the assay 
10 described in D.4. The data show that as the amount of 
the Maleimido-PEG reagent present in Maleimido- 
PEG/protein reaction mixture increased., the specific 
activity of the protein decreased (Figure 4). This 
suggests that increasing levels of PEG modification using 
15 the conventional method result in a decrease in the 
activity of the protein. 

G.5 Activity of Py^te ine-PFGyl a ted PW-i ^ PN .-, M „ f; , Btg 

The specific PN-1 mutants and cysteine-PEGylated 
mutants generated are shown in Table G.5A. Each of the 

20 PN-l site-directed mutant proteins, as well as wild type 
PN-i, were modified by cysteine-PEGylation using the 
protocol described in F.2. The activity of wild type PN- 
1, each of the PN-1 site-directed mutants, and the 
cysteine-PEGylated wild type and mutant proteins was 

25 determined using the assay described in D.4. 
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TABLE G.5A 




BEFOJ 


*E PEG 

I CAT I ON 


AFTER PEG MODIFICATION 


1 MUTANT 


SPECIFIC 
ACTIVITY 1 


REL TO 
WT 2 


ACTIVITY 


REL TO 
ACTIVITY 
BEFORE 
PEG 3 


REL TO 
WT 2 


WT 


0,50 


1 


0.017 


0.03 




N99C 


0.20 


0.40 


0.50 


2.5 


1.0 


N140C 


0.050 


0.10 


0.10 


2 


0.20 


N99C; 
N140C 


0.167 


.33 


0.33 


2 


0.75 


SIC 5 


0.10 


0.20 


0.25 


2.5 


0.50 


J R63C 5 


0.050 


0.10 


0.125 


2.5 


0. 25 


1 N85C 5 


0.033 


0.066 


0.040 


1.2 


0.080 


1 D147C 5 


0. 063 


0.125 


0.125 


2 


0.25 


1 D192C 5 


0.045 


0.091 


0.91 


2 


0.18 


1 E230C 5 


0.071 


0.14 


0.20 


2.8 


0.40 


H252C 5 


0.10 


0.20 


0.25 


2.5 


0.50 


J H252C 5 


0.10 


0.20 


0.25 


2.5 


0.50 


| N304C 5 


0.056 


0.11 


0.17 


3 


0.33 


C117S; 
C131S; 
C209S 


0.050 




0.50 







10 



15 



20 



25 



30 



1 Sp.Act is NIH units of thrombin inhibited per ua PN-l 
(variant) . ^ 

2 Activity relative to wild type is calculated by 
dividing the activity of wild type PN-l by the activity 
of the mutant. 2 

3 Activity of cysteine-PEGylated protein relative to 
activity of this protein before PEGylation is 
calculated by dividing the activity of the mutant 
before PEGylation by the activity of the mutant after 
PEGylation. 

4 Activity of cysteine-PEGylated PN-l is variable as 
modification of the naturally occurring Cys 209 
inhibits activity. 
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5 Th s mutants are in the N99C;N140C mutant background 
(i.e. these are triple mutants). 



5 



The mutations described here for PN-1 can be 
introduced into any serpin with the expectation of 
substantially similar effects due to the homology between 
the members of the serpin protein family. 



**** Test for Half-T.ifP. of Cvsteing -PEGVIa^ pw-T a „rt 

PN-1 Mllfran+g 

The circulating half-life of any protein can be 
10 measured by standard methods well known in the art. For 
example, radioactive PEG-modified protein is injected 
into a mouse, rat, or rabbit. At various times, blood is 
withdrawn and the amount of protein remaining in 
circulation is determined by scintillation counting. 
15 Alternatively, PEG-modified PN-1 is injected into a 
mouse, rat, or rabbit. At various times, blood is 
withdrawn and urokinase inhibitory activity is measured. 
In some cases, the amount of protein remaining in 
circulation can be measured with antibody reaction as in 
20 an ELIZA or sandwich ELIZA. 



Si2 — Administration o f Cvstaine-PEGvlated PN-1 

Cysteine-PEGylated PN-1 and/ or the cysteine- 
PEGylated PN-1 mutants described above may be used in the 
treatment of a variety of disease states for which PN-1 

25 is indicated as therapeutically useful. For example, the 
proteins may be incorporated into a bandage for dressing 
a wound as described in USPN 5,196,196, herein 
incorporated by reference with respect to the use (e.g. 
dosages and routes of administration) of PN-1 in wound 

30 dressings. Alternatively, cysteine-PEGylated PN-1 and/or 
cysteine-PEGylated PN-1 mutants may be incorporated as 
the active ingredient (s) in a pharmaceutical compositions 
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for treatment of inflammation and arthritis, as described 
in USPN 5,.20 6/ oi7 and USPN 5,326,562, each incorporated 
herein by reference with respect to the use (e.g. dosages 
and routes of administration) of pn-1 in treatment of 
such conditions. 



EXAMPLE H 

G eneration of r^Mnn-^^ ^ronn^H^ ^ 

.Selection of Amino Resic j npe fn „ 

Substitnl-irm 

" The amino acid sequence of erythropoietin (EPO) is 

as follows: 

M ™CP^LLLSI^SLPI^LPVLGAPPm J lCDSRVWRYl J LEAKEAE 50 
HITTGCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEA 100 
VLRGQALLVKSSQPWEPLQUIVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 
15 AASAAPLRTITADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR 194 

The first 27 amino acids of the protein (italicized, are 
the EPO signal sequence. The amino acid residues which 
are xn bold and underlined above (N24, N38, and N83, are 
sites of N-glycosylation in the native EPO protein 
20 These sites are thus selected for replacement with a 
cysteine residue, which is subsequently modified by 
PEGylation. 

^ — Site-directed Min- a rren esi g of ^ 

The complete nucleotide sequence which codes for 
25 the mature EPO protein is known in the art and available 
from GenBank. DNA encoding the mature EPO protein is 
cloned and subjected to site-directed mutagenesis as 
described in D.2. Oligonucleotides for replacement of 
residues N24, N38, and N83 with cysteine are as follows- 
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N24C: GCC AAG GAG GCC GAG TGT ATC ACG ACG GGC 
N38C: TGC AGC TTG AAT GAG TGT ATC ACT GTC CCA 
N83C: GCC CTG TTG GTC JGC TCT TCC CAG CCG 

The residues in bold and underlined indicate the 
5 nucleotides which are different relative to the wild type 

EPO DNA sequence and represent the cysteine codon to be 

introduced into the EPO amino acid sequence. 

The mutant EPO proteins which contain a cysteine 

residue at N24, N38, N83, or a combination of these site 
0 (e.g. double and triple mutants) are generated as 

described, expressed, and purified using techniques well 

known in the art. 



15 



20 



fi*- 3 - — Generation of Cvst eine-PEfiylated kpo a ^ 
Cvsteine-P EGvlated EPO Mutants 

Purified EPO mutants N24C, N38C, N83C, and mutants 
containing combinations of these mutations are subjected 
to cysteine-PEGylation using the protocol described in 
F.2. Samples of the reacted proteins are analyzed by 
SDS-PAGE to determine the extent of the PEGylation, as 
well as the minimum amount of the Maleimido-PEG reagent 
necessary to produce fully PEGylated protein. 

The activity of the cysteine-PEGylated wild type 
EPO, as well as the cysteine-PEGylated EPO mutants are 
tested using protocols known in the art. 

25 EXAMPLE I 

Genera tion of Cvsteine-PEGvlafceri 
Human Gro wth Hormone rhGHl 

it-i Selection of Amino Acid Residues for- Cysteine 

Substitution 

30 The nucleotide sequence, amino acid sequence, and 

the three-dimensional structure of human growth hormone 
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(hGH) are well known in th art (see, for example, 
Cunningham and Wells 1989 Science 244:1081-1085)/ m 
addition, the three-dimensional structure of hGH bound to 
its receptor is known (De Vos et al. 1992 Science 
5 255:306-312). Amino acid residues for replacement with 
cysteine are selected based upon the solvent- 
accessibility of the amino acid residue, the proximity of 
the residue to other amino acid residues with which it 
may interact, and the distance of the residue from 
10 regions of hGH which are known to be important for 
receptor binding (Cunningham and Wells 1989 Science 
244:1081-1085). 

— Site-di rected Mutagenesis of hftH 

Oligonucleotides for site-directed mutagenesis are 
15 designed so as to introduce a cysteine residue in place 
of the amino acid residue (s) selected above, site- 
directed mutagenesis is performed as described in D.2. 
The resulting hGH DNA is then inserted into an expression 
vector, and the resultant protein is expressed in E. coli 
20 or other suitable host. The resulting hGH mutant protein 
is then purified according to methods known in the art. 

ii3 — Generation of C ysteine-PEGvlafcgri v>rzu . 

The hGH mutant protein is then subjected to 
cysteine-PEGylation using the method outlined in F.2. A 
25 sample of a reacted mixture of Maleimido-PEG and hGH 
mutant protein is analyzed by SDS-PAGE to determine the 
optimal conditions for cysteine-PEGylation (e.g. the 
minimal amount of the Maleimido-PEG reagent necessary to 
provide the desired PEGylated hGH mutant protein) . 

The cysteine-PEgylated hGH protein is then tested 
for activity by assaying for the ability of the modified 
protein to bind to purified, truncated hGH receptor, as 
described in Cunningham and Wells (ibid.). 



30 
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EXAMPLE J 

Generate of Hemoglobin h Y ry ^ eine-Pirey i:.*^ 

Cross-Li nvs nq 
Hemoglobin is a tetrameric protein complex 
5 composed of two "a" chains and two »b» chains. The amino 
acid sequences of the »a« and »b» chains of hemoglobin, 
as well as the tetrameric complex of hemoglobin composed 
of 2 "a" and 2 «b» chains are well known. Appropriate 
ammo acid residues which are solvent-accessible and 
10 minimally contacted with other side chains are selected 
for site-directed mutagenesis to cysteine. The -a" and 
"b- chain mutants are then expressed, purified, and 
allowed to form a tetramer. The mutant tetrameric 
complex is then reacted with various levels of 
15 (Maleimido, 2 -PEG as described in F.4 above. This reaction 
can be carried out with very dilute hemoglobin levels to 
form intramolecular cross-links to stabilize the 
tetrameric form of hemoglobin, with a minimum number of 
intermodular cross-links. Alternatively, the reaction 
20 can be carried out a higher with a higher hemoglobin 
concentration, resulting in higher levels of 
intermodular cross-linking to stabilize an aggregate of 
hemoglobin molecules. 

While the present invention has been described 
25 with reference to specific protease nexin-1 variants and 
formulations containing such, it should be understood by 
those skilled in the art that various changes may be made 
and equivalence may be substituted without departing from 
the true spirit and scope of the invention, in addition, 
30 many modifications may be made to adapt a particular 
situation, material, excipient, PN-1 variant, process, 
process step or steps to the objective, spirit and scope 
of the invention. All such modifications are intended to 
be within the scope of the claims appended hereto. 
35 WHAT IS CLAIMED IS: 
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CLAIMS : _ _ 

1 1. A protease nexin-l variant wherein an amino 

2 acid residue at a position selected from the group 

3 consisting of P4, P3, P2, Pi, pi', P2 », P3 • , P4 . is 

4 replaced with a natural amino acid residue which is 

5 different from the amino acid residue naturally present 

6 at that position. 



l 

2 
3 
4 

5 



6 

7 
8 
9 
10 



2. The protease nexin-l variant of claim 1, 
wherein the variant has a different protease specificity 
as compared with protease nexin-l and/or an increased 
rate association constant with respect to a specific 
protease as compared with protease nexin-i. 



1 3. a protease nexin-l variant wherein amino acid 

2 residues at the active site of protease nexin-l are 

3 replaced with an equivalent number of active site amino 

4 acid residues of a serine protease inhibitor other than 

5 protease nexin-l. 

1 4. The variant of claim 3, wherein the serine 

2 protease inhibitor is selected from the group consisting 



of antithrombin III, heparin cof actor II, a-l- 
antitrypsin, o-l-protease inhibitor, plasminogen 
activator inhibitor I, II, & m, a-2-antiplasmin, 
kallikrein-binding protein, and Cl-inhibitor. 



1 5. The variant of claim 3, wherein amino acid 

2 residues at positions P4, P3, P2, Pi, P i • , P2 • , P3 . P4 

3 — • ■ ' 

4 

5 of: 



of the active site of protease nexin-l are replaced with 
an amino acid sequence selected from the group consisting 

£4 £3 21 E, P,J. P^l Pjl Pi > 

Val- Ser- Ala- Arg Met- Ala- Pro- Glu 

Met- Thr- Gly- Arg Thr- Gly- His- Gly 

Phe- Thr- Phe- Arg Ser- Ala- Arg- Leu 

He- Ala- Gly- Arg Ser- Leu- Asn- Pro 
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11 

12 

13 

14 

15 

16 

17 
18 
19 

20 

1 6 

2 



3 
4 
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Ala- Met- S r- Arg Met- Ser- Leu- Ser 

Ser- Val- Ala- Arg Thr- Leu- Leu- Val 

He- Leu- Ser- Arg Arg- Thr- Ser- Leu 

Phe- Arg- He- Leu Ser- Arg- Arg- Thr 

Ala- He- Pro- Met Ser- He- Pro- Pro 

Glu- Lys- Ala- Trp Ser- Lys- Tyr- Gin 

Leu- Leu- Ser- Ala Leu- Val- Glu- Thr 

He- Thr- Leu- Leu Ser- Ala- Leu- Val 

Phe- Met- Pro- Leu Ser- Thr- Glu- Val 

Met- Thr- Gly- Arg Thr- Gly- His- Gly. 



A protease nexin-l variant wherein three or 
more amino acid residues of the active site of protease 
nexin-1 are replaced with different amino acid residues 
which comprise a substrate sequence specific for a given 



5 protease. 

1 7. The variant of claim 6, wherein the given 

2 protease is selected from the group consisting of 
3 

4 

5 



elastase, cathepsin G, Cl-esterase, thrombin, kallikrein, 
and Factor Xa, Factor IXa, Factor xia, Factor Xlla, 
Factor Villa, Factor V , Activated Protein C, trypsin, 



6 chymotrypsin. 



1 8. A protein or portion thereof containing three 

2 or more amino acids, at least one of which is cysteine, 

3 wherein polyethylene glycol is covalently bound to a tnio 

4 group of the cysteine. 
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l 9. A modified protein comprised of the amino 

acid sequence of a naturally occurring protein which 



6 
7 
8 



sequence includes at least one cysteine residue wherein 
the modification comprises the coupling of polyethylene 
glycol to a cysteine residue of the protein. 



1 10. A method of coupling polyethylene glycol to a 

2 protein comprising the steps of: 

3 identifying a protein of interest and 

4 determining a site for coupling of polyethylene glycol to 

5 the protein; 

coupling polyethylene glycol to the protein 
at said site wherein the polyethylene glycol is 
covalently bound to a thio group of a cysteine residue of 
the protein. 



11. The method of claim 10, wherein the cysteine 
residue is naturally present in the protein. 

12. The method of claim 10, wherein the protein 
is altered to include a cysteine residue not normally 
present and the polyethylene glycol is covalently bound 
to the added cysteine residue. 

13. The method of claim 12 wherein the site for 
coupling of polyethylene glycol is within a solvent 
accessible region of the protein. 

14. The method of claim 12 wherein the protein is 
altered to include the cysteine residue at a site of 
glycosylation. 

15. The method of claim 12, wherein the protein 
is altered to include multiple cysteine residues. 
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1 16. The method of claim 10 wherein said 

2 polyethylene glycol comprises at least two 

3 protein-reactive moieties. 



1 17* The method of claim 10, wherein said 

2 polyethylene glycol is between 200 and 10,000 molecular 

3 weight. 

1 18. The method of claim 10, wherein the protein 

2 of interest is protease nexin-1. 



1 19, The method of claim 10, wherein the protein 

2 of interest is a protease nexin-1 variant, said variant 

3 having an amino acid residue at a position selected from 

4 the group consisting of P 4 , P 3 , P 2 , P,, p,', P 2 ' , p 3 ' and 

5 P 4 ' replaced with a natural amino acid residue which is 

6 different from the amino acid residue naturally present 

7 at that position. 

1 20. The method of claim 10, wherein the protein 

2 of interest is a protease nexin-1 variant wherein at 

3 least one amino acid residue at the active site of 

4 protease nexin-1 is replaced with an equivalent number of 

5 active site amino acid residues of a serine protease 

6 inhibitor _other than protease nexin-1. 

1 21. The method of claim 10, wherein the protein 

2 of interest is a protease nexin-1 variant wherein three 

3 or more amino acid residues of the active site of 

4 protease nexin-1 are replaced with different amino acid 

5 residues which comprise a substrate sequence specific for 

6 a given protease. 



WO 95/11987 



PCT/US94/U624 



- 109 - 



1 22. A modified protease nexin-l protein comprised 

2 of the amino acid sequence of naturally occurring 
protease nexin-l protein which sequence includes at least 
one cysteine residue, wherein the modification comprises 
the coupling of polyethylene glycol to a cysteine residue 



6 of the protein. 



1 23. A modified protease nexin-l variant comprised 

2 of the amino acid sequence of protease nexin-l, wherein 

3 an amino acid residue at a position selected from the 

4 group consisting of P 4 , P 3 , P 2 , P „ p,/, Pj , , p 3 , and p<# . g 

5 replaced with a natural amino acid residue which is 
different from the amino acid residue naturally present 
at that position, and which sequence includes at least 
one cysteine residue, wherein the modification comprises 
the coupling of polyethylene glycol to a cysteine residue 

10 of the protein. 

1 24. The modified protease nexin-l variant protein 

2 of claim 23, wherein the variant has a different protease 

3 specificity as compared with protease nexin-l and/or an 

4 increased rate association constant with respect to a 
specific protease as compared with protease nexin-l. 



5 



1 25. A modified protease nexin-l variant wherein 

2 amino acid residues at the active site of protease nexin- 

3 1 are replaced with an equivalent number of active site 

4 amino acid residues of a serine protease inhibitor other 
than protease nexin-l, and the amino acid sequence of the 
variant protein includes at least one cysteine residue, 
wherein the modification comprises the coupling of 
polyethylene glycol to a cysteine residue of the protein. 



WO 95/11987 



PCT/US94/11624 



• 110 - 



1 
2 
3 
4 

5 



1 

2 



26. The modified variant of claim 25, wherein the 
serine protease inhibitor is selected from the group 
consisting of antithrombin III, heparin cof actor II, a -l- 
protease inhibitor, plasminogen activator inhibitor I, 
II, & III, a-2-antiplasmin, kallikrein-binding protein, 



6 and Cl-inhibitor. 



27. The modified variant of claim 25, wherein 
amino acid residues at positions P 4 , P 3 , p 2 , Pj , Pj , f ^ f 



3 P 3 ' and P 4 ' of the active site of protease nexin-l are 

4 replaced with an amino acid sequence selected from the 



group i^unsisting of: 



6 


24 


£3 


£2 


£j 


£il 


£2! 




£4! 


7 


Val- 


Ser- 


Ala- 


Arg 


Met- 


Ala- 


Pro- 


Glu 


8 


Met- 


Thr- 


Gly- 


Arg 


Thr- 


Gly- 


His- 


Gly 


9 


Phe- 


Thr- 


Phe- 


Arg 


Ser- 


Ala- 


Arg- 


Leu 


10 


Ile- 


Ala- 


Gly- 


Arg 


Ser- 


Leu- 


Asn- 


Pro 


11 


Ala- 


Met- 


Ser- 


Arg 


Met- 


Ser- 


Leu- 


Ser 


12 


Ser- 


Val- 


Ala- 


Arg 


Thr- 


Leu- 


Leu- 


Val 


13 


Ile- 


Leu- 


Ser- 


Arg 


Arg- 


Thr- 


Ser- 


Leu 


14 


Phe- 


Arg- 


Ile- 


Leu 


Ser- 


Arg- 


Arg- 


Thr 


15 


Ala- 


Ile- 


Pro- 


Met 


Ser- 


Ile- 


Pro- 


Pro 


16 


Glu- 


Lys- 


Ala- 


Trp 


Ser- 


Lys- 


Tyr- 


Gin 


17 


Leu- 


Leu- 


Ser- 


Ala 


Leu- 


Val- 


Glu- 


Thr 


18 


Ile- 


Thr- 


Leu- 


Leu 


Ser- 


Ala- 


Leu- 


Val 


19 


Phe- 


Met- 


Pro- 


Leu 


Ser- 


Thr- 


Glu- 


Val 


20 


Met- 


Thr- 


Gly- 


Arg 


Thr- 


Gly- 


His- 


Gly 



21 and the variant protein contains at least one cysteine 

22 residue, wherein the modification comprises the coupling 

23 of polyethylene glycol to a cysteine residue of the 

24 protein. 
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28. A modified prot ase n xin-1 variant wherein 
three or more amino acid residues of the active site of 
protease nexin-1 are replaced with different amino acid 
residues which comprise a substrate sequence specific for 
a given protease and the amino acid sequence of the 
variant protein includes at least one cysteine residue 
wherein the modification comprises the coupling of 
polyethylene glycol to a cysteine residue of the protein. 

29. The modified protease nexin-1 variant of 
claim 28, wherein the given protease is selected from the 
group consisting of elastase, cathepsin G, Cl-esterase, 
thrombin, kallikrein, Factor Xa, Factor IXa, Factor 
XXIa, Factor Villa, Factor V , Activated Protein C, 
trypsin, and chymotrypsin. 

30. A compound having the following general 
structural formula: 

Rj-S-PEG-S-R 2 

wherein R x and R 2 are independently each an amino acid 
sequence, each S is a thio group of a cysteine residue of 
each of R x and R^ and PEG is polyethylene glycol. 

31. The compound of claim 30, wherein R, and R 2 
each independently comprise from about 6 to 1,000 amino 
acids. 

32. The compound of claim 30, wherein said 
polyethylene glycol is from 200 to 10,000 molecular 
weight. 

33. The compound of claim 30, wherein in R, and 
R 2 are the same. 
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1 34 • T1 »e compound of claim 30, wherein R, is 

2 hemoglobin a chain and R 2 is hemoglobin b chain. 

1 35. A compound having the following general 

2 formula: 

3 R, - (S - peg - s - R l ) D 

4 wherein R, and R 2 are independently each amino acid 

5 sequences, S is a thio group of a cysteine residue of 

6 each of R, and Rj, and PEG is polyethylene glycol. 

1 36. The compound of claim 35, wherein said 

2 polyethylene glycol is from 200 to 10,000 molecular 

3 weight. 

1 37. The compound of claim 35, wherein R, and R 2 

are the same. 

38. The compound of claim 37, wherein R, is 
hemoglobin a chain and R 2 is hemoglobin b chain. 

39. A DNA sequence encoding the protease nexin-1 
variant of claim l. 



40. A DNA sequence encoding the protease nexin-1 
variant of claim 2. 



41. A DNA sequence encoding the protease nexin-1 
variant of claim 4. 



42. A DNA sequence encoding the protease nexin-1 
variant of claim 5. 



43. A DNA sequence encoding the chimeric protease 
of claim 8. 
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1 44. A pharmaceutical composition, comprising: 

2 a pharmaceutical^ acceptable carrier; and 

3 a variant of protease nexin-l as claimed in 

4 claim 1. 

1 45. A pharmaceutical composition, comprising: 

2 a pharmaceutical^ acceptable carrier; and 

3 a variant of protease nexin-l as claimed ii 

4 claim 2. 



46. a pharmaceutical composition, comprising: 
a phannaceutically acceptable carrier; and 
a variant of protease nexin-l as claimed i] 

claim 4. 



47. a pharmaceutical composition, comprising: 
a pharmaceutical^ acceptable carrier; and 
a variant of protease nexin-l as claimed in 

claim 5. 



48. a method of producing a variant protein, 
comprising: 

connecting DNA encoding amino acids of a 
receptor binding region of a first naturally occurring 
protein with DNA encoding amino acids of a second protein 
or a biologically active portion thereof which is 
different from the first protein; and 

expressing the DNA in a suitable host. 

49. The variant of claim 48, wherein the second 
protein is a variant of PN-1 wherein an amino acid 
residue at a position selected from the group consisting 
of P4, P3, P2, Pi, Pi', p 2 ', P3 ', p 4 ' is replaced with a 
natural amino acid residue which is different from the 
amino acid residue naturally present at that position. 
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50. A method as in claim 48 wherein the receptor 
binding region is from a first protein selected from the 
group consisting of urokinase, tPA, Factor IX, Factor X, 
Protein C, Epidermal growth factor and EGF-like domains. 

51. A method as in claim 50 wherein the receptor- 
binding region is an amino terminal fragment of 
urokinase. 

52. A method as in claim 51 where the receptor- 
binding region is an amino terminal fragment of urokinase 

Aiiuj.uuj.ny cunxiiu d^lQS 1"1JD Oj7 J. — 8 / • 

53. A pharmaceutical composition, comprising: 
a pharmaceutically acceptable carrier; and 
a protein or portion thereof as claimed in 

claim 8. 
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